项目作者: technicianted

项目描述 :
Library for Microsoft Cognitive Services speech recognition
高级语言: C
项目地址: git://github.com/technicianted/libmsspeech.git
创建时间: 2017-05-19T05:07:29Z
项目社区:https://github.com/technicianted/libmsspeech

开源协议:Other

下载


libmsspeech

Library for Microsoft Cognitive Services speech recognition. For more details about usage, take a look at my blog post.

This is the very first version that works. Do not use it in any serious application yet!

Prerequisites

Building

  1. autoreconf --force --install
  2. ./configure
  3. make

Using

Start by running exampleProgram to learn how to use the library:

  1. Usage: exampleProgram [OPTION...] <key> <language>
  2. -d Produce debug output.
  3. -f FILE Audio input file, stdin if omitted.
  4. -m MODE Recognition mode:
  5. -p MODE Set profanity handling mode {raw|masked|removed}. Default is masked.
  6. {interactive|dictation|conversation}. Default is interactive.
  7. -t Request detailed recognition output.

To recognize a file:

  1. exampleProgram -f <path to wav> -m interactive <your subscription key> en-us

On Linux, you can stream audio directly from microphone using Debian alsa-utils:

  1. arecord -c 1 -r 16000 -f S16_LE | ./exampleProgram -m interactive <your subscription key> en-us

or perform long dictation on Steve Jobs Standford University commencement speech:

  1. curl -L -s https://archive.org/download/SteveJobsSpeechAtStanfordUniversity/SteveJobsSpeech_64kb.mp3 | \
  2. mpg123 -w - -m -r 16000 -e s16 - | \
  3. ./exampleProgram -m dictation <your subscription key> en-us

More explanation and details on how to use the library can be found in this blog post.