I'd like to build an app that uses voice recognition. I've seen big companies like Google etc implement this feature, but I'm curious about doing it on a start-up level. Anyone looked into this? Are there any tools out there for us to do this?
+4
A:
The best approach will probably be to:
- Record the voice on the phone
- Send the recording to a server that runs the speech recognition software
- Then return something to the phone to indicate what it should do
John Meagher
2009-06-02 23:26:03
That's a lot of data to send. I might try it on the iPhone itself. After all, PCs could do a fair job of this 10 years ago, so perhaps iPhones should be able to now.
Nosredna
2009-06-02 23:28:21
This is actually the technique the Google Search app uses
PiPeep
2009-07-08 20:28:25
Google encodes the voice in a special way, they don't just send the raw audio data for exactly the reason Nosredna gave.
Alexander Kellett
2010-07-07 11:48:52
+5
A:
If you start here at wikipedia, you'll get a good list engines (http://en.wikipedia.org/wiki/Speech_recognition#Commercial_software.2Fmiddleware)
As I write this (June 24, 2009) it looks to me that are two viable open source solutions
- Pocket Sphinx (http://www.speech.cs.cmu.edu/pocketsphinx/)
- Julius (http://en.wikipedia.org/wiki/Julius_(software))
Both have been used in iphone apps, but the iphone friendly source isn't readily available.
As I edit this (8 July, 2009) I recently learned that Loquendo (http://www.loquendo.com/en/) has voice recognition and speech synthesis (ASR & TTS) for the iphone.
JJ Rohrer
2009-06-24 20:04:09
@Rohrer, Will Apple approve this if we add any external engines for the voice recognition
Shibin Moideen
2010-08-19 08:29:03
@Shibin - I've never heard of such apps being rejected, and I wouldn't expect them to be, either, but your mileage may vary. You can always search around for users of a particular sdk and make sure their apps are actually being published. This would be particularly easy with the commercial sdks.
JJ Rohrer
2010-08-23 13:14:08
A:
Here is a commercial SDK for doing so: http://www.creaceed.com/ceedvocalsdk/
Looks like a wrapper for Julius.
Alexander Kellett
2010-03-08 22:50:03