List of speech recognition software
This is a list of speech recognition software.
Open source acoustic models and speech corpus (compilation)
The following list presents notable speech recognition software engines with a brief synopsis of characteristics.
Application name | Description | Open Source | License | Operating System | Programming Language | Supported Language/Note | Offline vs Online |
---|---|---|---|---|---|---|---|
CMU Sphinx | HMM | Yes | BSD style | Multi-platform | C/Java/Python | English | Offline |
HTK | HMM neural net. | No | HTK Specific License | Multi-platform | C | English. Version 3.5 released December 2015. | |
Julius | HMM trigrams | Yes | BSD-like | Multi-platform | C | Japanese, English(non-commercial)[1] | Offline |
Kaldi | Neural net | Yes | Apache | Multi-platform | C++ | English | |
RWTH ASR | RWTH Aachen University | No | RWTH ASR License | Linux, macOS | C++ | English. Non-commercial use only |
The following lists open-source applications that provide convenient user interfaces for the above.
Application name | Description | Open Source | License | Operating System | Programming Language | Supported Language/Note |
---|---|---|---|---|---|---|
Simon | Supports Sphinx, HTK, Julius | Yes | GPLv2 | Multi-platform | C++ | English |
Jasper project | Raspberry Pi front-end for CMU Sphinx or Julius | Yes | MIT License | Linux | Python | English |
Macintosh
Application name | Description | Open Source | License | Price | Note |
---|---|---|---|---|---|
Dragon Dictate | Mac OS (by Nuance) | No | Proprietary | ||
MacSpeech Scribe | Transcription from recorded text | ||||
iListen | PowerPC Macintosh | ||||
Speakable items | Included with Mac OS | ||||
ViaVoice | IBM Product. Purchased by Nuance. | ||||
Voice Navigator | Original GUI voice control (1989) |
Cross-platform web apps based on Chrome
The following list presents notable speech recognition software that operate in a Chrome browser as web apps. They make use of HTML5 Web-Speech-API.[1]
Application name | Description | Open Source | License | Price | Note |
---|---|---|---|---|---|
Sonix | Independently reviewed as most accurate automated transcription service | No | Commercial | 10¢/minute w/ subscription | |
Happy Scribe[2] | Cloud based transcription service | No | Commercial | From 10¢/minute | |
Voice Notebook | Free dictation, voice typing to clipboard and to web text fields. Windows and Linux integration | No | Commercial | Free | |
SpeechTexter[3] | Online speech recognition and editor | No | Commercial | Free | |
Speechnotes | Dictation notepad - professional speech recognizing text editor web app | No | Commercial | Free | |
Go-Transcribe.com | Cloud based transcription service | No | Commercial | Subscription based | |
Speechmatics[4] | Cloud based and on-premise automatic speech recognition | No | Commercial | From £0.06 per minute of audio | |
Trint | Convert audio/video to text, search and verify in online editor that glues audio to text. | No | Commercial | From 17¢/minute |
Mobile devices and smartphones
Many cell-phone handsets have basic dial-by-voice features built in. Smartphones such as iPhones and BlackBerrys also support this. A number of third-party apps have implemented natural-language speech-recognition support, including:
WindowsWindows built-in speech recognitionThe Windows Speech Recognition version 8.0 by Microsoft comes built into Windows Vista, Windows 7 and Windows 8. Speech Recognition is available only in English, French, Spanish, German, Japanese, Simplified Chinese, and Traditional Chinese and only in the corresponding version of Windows; meaning you cannot use the speech recognition engine in one language if you use a version of Windows in another language. Windows 7 Ultimate and Windows 8 Pro allow you to change the system language, and therefore change which speech engine is available. Windows Speech Recognition evolved into Cortana (software), a personal assistant found in Windows 10. Add-ons for Windows 7 speech recognition
Windows 7/8/10 third-party speech recognition
Windows XP or 2000 only
Built-in software
Interactive voice responseThe following are IVR/Interactive Voice response systems:
Unix-like x86 and x86_64 speech transcription software
Discontinued software
See alsoReferences
|