Training speech recognition software | ansaurus

tags:

speech-recognition

views:

54

answers:

1

Q:

Training speech recognition software

A little left field, but I'm trying to train a speech recognition program and the guidelines suggest that I attempt to speak clearly but naturally. I notice, however, that when one speaks naturally each word tends to drift into the next, resulting in a rather ambiguous boundary between the words.

One the one hand, speaking in a more stilted manner would seem to aid the computer in recognising the phonemes, but on the other it would tend to make it less likely to understand more natural speech.

Anyone knowledgeable in the field out there who can suggest which of the two approaches is more effective?

Thanks

+1 A:

Continuous-speech recognition is a different and more difficult problem than "discrete dictation" (the problem an IBM Research member of which I was a very junior member cracked about a quarter century ago;-). If "discrete" speech is acceptable for the given application, it's sure to give you higher recognition rates (will never confuse "recognize speech" with "wreck a nice beach";-). If it's absolutely not acceptable, however, then you should not use it (by definition of "absolutely" and "not acceptable";-).

Alex Martelli 2010-05-08 02:55:24

Interesting article: http://robertfortner.posterous.com/the-unrecognized-death-of-speech-recognition

TrueWill 2010-05-08 03:59:29

François 2010-07-29 23:17:50

related questions

C# Speech Recognition VISTA Problem

where to get SAPI ?

How do you efficiently create a grammar file for speech recognition given a large list of words?

Microsoft Speech Recognizer 6.1 Training Files

C# and SAPI, I have speech recognition but its picking up words im not interested in. How can I limit, not just over weight, the gramer dict?

Java voice recognition

How to split male and female voices from an audio file(in c++ or java)

CMU Sphinx Live Decoder

Speech to text conversion in Linux

Can I use the Vista speech API in Windows Server 2003?

Spoken Word Programming Language / System

Can I write SQL using speech recognition?

How to add words to an already loaded grammar using System.Speech and SAPI 5.3

Acoustic training using SAPI 5.3 Speech API

Vista Speech Recognition in Delphi

Question SpeechSynthesizer.SetOutputToAudioStream audio format problem

C# Speech Recognition - Is this what the user said?

What are the techniques for word recognition in a sound stream?

Voice Recognition Software For Developers

Speech Recognition for Searching Files

Vista speech recognition in multiple languages

What's a good open source VoiceXML implementation?

Anyone have experience with Sphinx speech recognition?

How do I search content, within audio files/streams?

How to get started with speech-to-text?