How to implement voice recognition in .NET?
I've got a bunch of .mp3
s with computer generated voice of relatively high quality. In each .mp3
the voice says some random text and then a sequence of 6-7 digits in English. I'd like to extract those digits as text.
Is it possible to implement such a thing in .NET?