speech-recognition

How to get started with speech-to-text?

I'm really interested in speech-to-text algorithms, but I'm not sure where to start studying up on them. A bunch of searching around led me to this, but it's from 1996 and I'm fairly certain that there have been improvements since then. Does anyone who has any experience with this sort of stuff have any recommendations for reading / sou...

How do I search content, within audio files/streams?

I have always wondered how many different search techniques existed, for searching text, for searching images and even for videos. However, I have never come across a solution that searched for content within audio files. For example: Let us assume that I have about 200 podcasts downloaded to my PC in the form of mp3, wav and ogg file...

Anyone have experience with Sphinx speech recognition?

Has anyone used the Sphinx speech recognition stack to build IVR applications? I am looking for open source alternatives to the expensive and somewhat limiting choices from MSFT and others. I have not been able to find a comprehensive package that ties open source speech/voip applications together. ...

What's a good open source VoiceXML implementation?

I am trying to find out if it's possible to build a complete IVR application by cobbling together parts from open source projects. Is anyone using a non-commercial VoiceXML implementation to build speech-enabled systems? ...

Vista speech recognition in multiple languages

Hi, my primary language is spanish, but I use all my software in english, including windows; however I'd like to use speech recognition in spanish. Do you know if there's a way to use vista's speech recognition in other language than the primary os language? ...

Speech Recognition for Searching Files

Here is the problem I have: I have a lot (tens of thousands) of mp3 files that my users would like to be able to search. Is there is software out there that you've used or heard good things about that would allow me to index that content and put it in a database so I can search on it later? ...

Voice Recognition Software For Developers

Well the doc's finally said it, I need to take it easy on my wrist for a few months. Being that I'm a .NET Developer this could end my livelihood for a little while, something I'm not anxious to do. That said are there any good handsfree options for developers? Anyone had success using any of the speech recognition software out there?...

What are the techniques for word recognition in a sound stream?

I'm quite curious about this. In a broad way, how does one go about doing the following: Detection of word separations. Detection of syllables. Compensate for normal speech word connections. ...

C# Speech Recognition - Is this what the user said?

I have need to write an application which uses a speech recognition engine -- either the built in vista one, or a third party one -- that can display a word or phrase, and recognise when the user reads it (or an approximation of it). I also need to be able to switch quickly between languages, without changing the language of the operati...

Question SpeechSynthesizer.SetOutputToAudioStream audio format problem

Hi, I'm currently working on an application which requires transmission of speech encoded to a specific audio format. System.Speech.AudioFormat.SpeechAudioFormatInfo synthFormat = new System.Speech.AudioFormat.SpeechAudioFormatInfo(System.Speech.AudioFormat.EncodingFormat.Pcm, 8000, ...

Vista Speech Recognition in Delphi

I would like to be able to dictate into my Delphi application using Microsoft Vista's speech recognition. However when attempting to dictate into a TMemo, it simply does not work. I noticed that Firefox 3.0 has the same issue and after they contacted Microsoft about the problem, they told them that they need to implement the Text Service...

Acoustic training using SAPI 5.3 Speech API

Using Microsoft's SAPI 5.3 Speech API on Vista, how do you programatically do acoustic model training of a RecoProfile? More concretely, if you have a text file, and an audio file of a user speaking that text, what sequence of SAPI calls would you make to train the user's profile using that text and audio? ...

How to add words to an already loaded grammar using System.Speech and SAPI 5.3

Given the following code, Choices choices = new Choices(); choices.Add(new GrammarBuilder(new SemanticResultValue("product", "<product/>"))); GrammarBuilder builder = new GrammarBuilder(); builder.Append(new SemanticResultKey("options", choices.ToGrammarBuilder())); Grammar grammar = new Grammar(builder) { Name = Constants.GrammarName...

Can I write SQL using speech recognition?

I have wrist pain when I type and I would like to start writing SQL statements, stored procedure, and views using speech recognition. ...

Spoken Word Programming Language / System

I recently came upon something called Rpoku which is a spoken computer language. I also found some research called Voice Code which takes a different approach. There's a demo video here.. Does this look promising, or is just a toy? Is it or will it be possible to create applications some day without typing at a keyboard? ...

Can I use the Vista speech API in Windows Server 2003?

I have writtena small app on the iphone that transfers 30 seconds of speech to a webserver that calls a windows service via remoting, the service uses SAPI5.1 on windows 2003 server to convert the wave to text via the defuault recognizer. I want to update the engine to the latest and add a medical speech model that was created to run ...

Speech to text conversion in Linux

I am planning to start an application which converts the speech to text in Linux. Are there any existing interfaces so that I can extend them? or Is there any such existing application in Linux? Any inputs on this? EDIT: The application that I am planning to write should be able convert every word that we speak to text, not just the Yes...

CMU Sphinx Live Decoder

I've been doing some expirementations on sphinx-3.5. The batch decoder works great, however, when I try the same model with the live decoder, results get very poor. I understand that the live decoder will surely use a faster search algorithm, but I still think there should be a way to improve it by playing with it's parameters. My probl...

How to split male and female voices from an audio file(in c++ or java)

I want to differentiate betwen the male n female voices in an audio file and seperate them.As an output I want the two voices seperated.Can u please help me out n can the coding be done in java or c++ ...

Java voice recognition

Is there Anyone that has experience with any open source, or relatively cheap voice recognition API for java? I'm pretty much looking for something that will turn spoken words into text. From the java speech recognition page on sun, it seems that it is something that is rather dead. My requirements is something that at the least runs on...