speech

Can anyone recommend a decent DSP/speech library in C++?

Google returns too much results, although SPUC caught my attention. Is there a standard recommended library like OpenCV for vision? The necessary features would be: Free Open Source filter design (Butterworth, Chebyshev, etc) FFT if possible, some speech processing features, like MFCC computation, although that's secondary, as I could ...

how to use Speech SDK 5.1

hi frnds, i recently downloaded Microsoft Speech SDk 5.1 and i want to learn that how to use it. Please suggest me some websites and links where i can find tutorials on using it from basic using C# or C++. thanx with regards :) ...

Speechlib on Shared hosting - ASP.NET

I am trying to use SpeechLib on my personal website. It's a very simple app that saves some text to a wav file - standard stuff. Works great on the dev machine. But all hell breaks loose when I deploy it to the shared host. Sometimes I get prompted for user name and password at the time of writing the wav file. Sometimes, I get the "Sec...

Does Naturally Speaking have a "show numbers" command like WSR?

Why doesn't Naturally Speaking have this command? You would think they would have released it by now. WSR is much better because of this. ...

how to generate sphinx speech phonems and map them to a word

Hi, I want to develop a speech recognition engine in java using sphinx.My language is not english.Using an english language model produces unexpected results.I wanted to know if its possible to generate a list of phonems for my own language and use it map words in a dictionary.I dont know where to begin with this.Do i have to record pron...

How to turn pcm audio into text using some lib written entirely in the C\C++ programming language?

How to turn pcm audio into text using some lib written entirely in the C\C++ programming language? So I have pcm file. I want to turn it into text. how to do it? (with speech recognizer lib of your choise (BTW i need it to work extreamly fast) So what do I need? Open Source Libs. Tutorials and blog articles on How to do/use it. ...

speaker dependent speech recognition engin with sdk

I want to do a little apllication, does any one know of a good speaker dependent speech recognition engin with sdk. (not speech to text engins) thank you, Efrat ...

Reading Source Code Aloud

After seeing this question, I got to thinking about the various challenges that blind programmers face, and how some of them are applicable even to sighted programmers. Particularly, the problem of reading source code aloud gives me pause. I have been programming for most of my life, and I frequently tutor fellow students in programming,...

Vector Quantization in Speech Processing Explanation

Hi all, I'm having trouble determining from this research paper exactly how I can reproduce the Standard Vector Quantization algorithm to determine the language of an unidentified speech input, based on a training set of data. Here's some basic info: Abstract info Language recognition (e.g. Japanese, English, German, etc) using acoustic...

Searching a database using Speech Server 2007

Hi, I am planning on building a telephone speech application with Microsoft Speech Server (now Office Communications Server 2007 R2). Before I get started I have been trying to find some sample code or tutorials on how to search for some recognised text which would be held in a database. The classic example is a telephone directory (I ...

Speech recognision in vb.net

Is Possiple to create Speech recognision in vb.net . Anybody tell to me thanks ...

I need an interesting computer science topic

I'm competing in a public speaking competition. It is about communicating scientific ideas to non-scientific audience in only three minutes. What computer science topic might make an interesting 3-minute presentation? The competition is famelab ...

Which Audio Codec should I use?

Which audio codec should I choose for my C# software development project where the raw audio to be transcoded/transported/used is based on the following: For encoding Speech only Audio can be stereo or mono Able to support live streaming Good tradeoff between file size and quality To be transported over TCP/IP in its encoded form Can b...

System.Speech and Voices

is it possible to create portable applications from Scansoft voices .exe files ? (www.portableapps.com). and then be able to access the voices programmatically through System.Speech namespace in .NET 3.5 ? I want to do that so I don't have to take a dedicated server just to install my text-to-speech web app. Help ! ...

How to use SAPI's SetNotifyCallbackFunction() in a CLR project with Windows Form as the interface window?

Hi, I'm trying to write a dll plugin for Winamp. I'm using Microsoft Visual Studio 2008 and Microsoft SAPI 5.1. I created the interface window using Windows Form (System::Windows::Forms::Form). I tried to use SetNotifyWIndowMessage(), but the method is never called when I speak to the microphone. So I tried using SetNotifyCallbackFunct...

Microsoft Agent Speech Recognition

Hi folk ; i've been working on project where i can make Microsoft Agent Read a Text that i wrote , but i wonder can i make Microsoft Agent Recognize Speech or don't bother trying . ...

Recognizing individual voices

I plan to write a conversation analysis software, which will recognize the individual speakers, their pitch and intensity. Pitch and intensity are somewhat straightforward (pitch via autocorrelation). How would I go about recognizing individual speakers, so I can record his/her features? Will storing some heuristics for each speaker's ...

Pocket Sphinx on Android

What are the steps required to use PocketSphinx on Android? I have found various hints around the web that it is possible, but not real answers and hence I ask my question here. ...

Dragon NaturallySpeaking Programmers

Is there anyway to encorporate Dragon NaturallySpeaking into an event driven program? My boss would really like it if I used DNS to record user voice input without writing it to the screen and saving it directly to XML. I've been doing research for several days now and I can not see a way for this to happen without the (really expensive)...

What is the difference between System.Speech.Recognition and Microsoft.Speech.Recognition?

There are two similar namespaces and assemblies for speech recognition in .NET. I’m trying to understand the differences and when it is appropriate to use one or the other. There is System.Speech.Recognition from the assembly System.Speech (in System.Speech.dll). System.Speech.dll is a core DLL in the .NET Framework class library 3.0 an...