ansaurus

Question

mac osx speech to text api How-to?

Answer 1

A:

Here's a good O'Reilly article to get you started.

Charlie Martin 2009-05-07 23:47:30

Thanks Charlie, Do you have some code example?

Roy Chan 2009-05-07 23:50:05

Answer 2

+2 A:

There's a number of examples that get copied under /Developer/Examples/Speech/Recognition when you install XCode.

Cocoa class for speech recognition is NSSpeechRecognizer. I've not used it but as far as I know speech recognition requires you to build a grammar to help the engine choose from a number of choices rather then allowing you to pass free-form input. This is all explained in the examples referred above.

diciu 2009-05-08 19:03:28

Answer 3

+3 A:

This comes a bit late perhaps, but I'll chime in anyway.

The speech recognition facilities in OSX (on both the Carbon and Cocoa side of things) are for speech command recognition, which means that they will recognize words (or phrases, commands) that have been loaded into the speech system language model. I've done some stuff with small dictionaries and works pretty well, but if you want to recognize arbitrary speech things may turn hairier.

Something else to keep in mind is that the functionality that the speech APIs in OS X provide is not one to one. The Carbon stuff provides functionality that has not made it to NSSpeechRecognizer (the docs make some mention of this).

I don't know about Cocoa, but the Carbon Speech Recognition Manager does allow you to specify inputs other than a microphone so a sound stream would work just fine.

Latrokles 2009-11-03 05:07:21

Answer 4

A:

You can use either ApplicationServices's SpeechSynthesis (10.0+)

CFStringRef cfstr = CFStringCreateWithCString(NULL,"Hello World!", kCFStringEncodingMacRoman);
Str255 pstr;    
CFStringGetPascalString(cfstr, pstr, 255, kCFStringEncodingMacRoman);   
SpeakString(pstr);

or AppKit's NSSpeechSynthesizer (10.3+)

NSSpeechSynthesizer *synth = [[NSSpeechSynthesizer alloc] initWithVoice:@"com.apple.speech.synthesis.voice.Alex"];
[synth startSpeakingString:@"Hello world!"];

valexa 2010-07-07 12:58:51

That's for synthesizing speech (text to speech), not recognizing speech (speech to text).

Peter Hosey 2010-07-07 19:34:56

it looks like i meant this reply for a different question .. and now i can not find that

valexa 2010-07-07 20:23:25

ansaurus

tags:

views:

answers:

mac osx speech to text api How-to?

related questions