speech-recognition

Speech Recognition.

I need to develop an iPhone application which recognizes speech, and based on the result it performs further tasks. I know iPhone 3.0 doesn't support speech recognition and I need to implement speech recognition software on the server side. I know this thing only, since I am newbie I don't know how to deal with that. Mean Which softwar...

Speech recognition with file as input

Possible Duplicate: speech recognition from audio file instead of microphone How can someone perform speech recognition with a file (.wav or .mp3) as input and using Delphi ? I want to use an audio file as input rather than the microphone. ...

Voice Command over the Web

I'm just interested in this scenario. Suppose you are building a web application, and would like to receive speech commands and execute actions based on the speech-to-text result. Any major roadblock that may otherwise make this a killer application? ...

Different levels in speech recognition software.

There are phonetic level, syntactic level, semantic level, phonological level, acoustic level, linguistic level, language level. Are there any other levels? What's the order from the bottom up? And what are they really about? ...

How do I parameterize android speech recognition? android.speech.action.RECOGNIZE_SPEECH does not do anything

How to I properly parameterize the speech recognizer so that it can more easily recognize the words that the user will probably say? As far as I can tell there is no way. I believe this site is wrong: http://www.4feets.com/2009/04/speech-recognition-in-android-sdk-15/ According to the site, this code: ArrayList< String > potential...

Mapping ISpMMSysAudio::SetLineId and mixerLineId

I want to set the lineId for SAPI by calling ISpMMSysAudio::SetLineId. I find out the line which I wish to set from the mixer handle I have. However the line id I get from the mixer is not what SAPI assumes. It results in SAPI listening to the wrong input line for mic thus getting no input. I'm getting the mux control for "DST_WAVEIN" f...

Speech recognition at MAC and PC

I have to make speech recogintion application that should work on MAC and PC, i have a idea about SAPI but it will for only for PC not for MAC. Is there any other API or frame work i can use to achieve this, programing language not bar for me. ...

How to get the correction commands while doing free dictation with sapi 5.3?

I have an sample application that do basic dictation with sapi 5.3. It works well to input text with punctuation... What would I need to do to enable the correction functionality as in the tutorial (correct word, select ...)? Is there a specific grammar to load (which and how), some specific events to implement? ...

Other language support in Sphinx speech recognizer

I am analyzing Sphinx speech recognizer for one of our project. Requirement is to covert speech to text with support for 6 languages including English (US). Does someone have any idea whether this is possible in Sphinx or not? The reason we have chosen Sphinx is that we want the application compatible with Windows, Mac and Linux platfor...

Sphinx II Recognition Read and Recognize

Hi, I'm trying to implement a speech recognizer client using Sphinx II, my goal is to have the someone reading a written sentence in a text and have a confidence score for each word. So for that I'm using Finite State Grammars and limited dictionary. I got something working, but the confidence score are very low (< 0.30). I'm using t...

Loading multiple grammar files in SAPI 5.1

I'm using an XML grammar file to develop a Command and Control application. Since there are around 4000 entries in the grammar I can't have it all in one file (it gives an error when I try to load it). I have written multiple XML grammar files, but when I try to load more than one file it replaces the previously loaded file. that is the ...

Speech Recognition & Programming

Has anyone had success with Dragon Naturally Speaking voice recognition software when it comes to programming? I am wondering because I think it would be a lot faster than me typing by hand, and easier on my carpol-tunnel. I program from day to day in visual basic 6 ide, visual studio 2008 ide + team explorer, writing emails, and cha...

Getting WAV file transcription to work with Sphinx4

Hi, I've got Sphinx-4 installed on my windows XP system and JSAPI set up. I'd like to transcribe an English spoken WAV (or MP3) file to text. When I run the "WavFile" demo - it runs successfully. java -jar WavFile.jar But, when I pass my own wav file like this: java -jar WavFile.jar c:\test.wav I get: Loading Recognizer...

Need help with Windows Speech Recognition script to compound a string

The command below for Windows Speech Recognition forces the speech recognition engine to use literal text (""all seventy six people paid five dollars") instead of the default ("all 76 people paid $5"). I'm trying to adapt this command to remove the spaces between words so that I could program using speech by saying things like: "Securi...

How to infer debug information from Alternates Panel output in WSR

I have this Windows Speech Recognition (WSRMacro) script which compounds multiple words that are spoken into a single word: "Happy children" -> "Happychildren" However, a bug in the script appears under certain circumstances and I do not know how to deduce what the problem is. Although the above example works, the following does not: ...

Custom vocabularies for Speech Recognition in Windows

Using Windows7 speech recognition I wish to create specialised vocabularies for recognising a domain-specific natural language in parts of my application. Thus, for example, a specific text entry box could be linked to its own vocabulary and limited to a small subset of language, e.g. throw the axe at the troll where "troll" is in the...

What is the best speech recognition library for Java Micro Edition?

What is the best speech recognition library for jme? ...

Connecting to Naturally Speaking through SAPI 4 interface, possible?

I read somewhere that you can interface to Naturally Speaking through the SAPI 4 interface it exports? Is this true and is there any documentation? Thanks ...

Voice recognition in J2ME

I'm developing a mobile application using j2me. There I need to have a speech recognition function, so that application should be able to process and act upon the commands given by the user. What I wanted to know is Is this technically possible (I'm a novice to j2me programming)? If it is possible, where can I find a j2me library for s...

Is there software that outputs speech-to-text at the Phonological level?

Is there any software out there capable of taking audio files and outputting phonological (IPA) text? I understand much of the software out there takes it straight to a language, but is there one that is 'teachable'? ...