ansaurus

Question

encoding mp3 from a audio stream of PyTTS

Answer 1

+1 A:

I don't think PyTTS produces default PCM data (i.e. 44100 Hz, stereo, 16-bit). You should check the format like this:

memStream = tts.SpeakToMemory("some text")
format = memStream.Format.GetWaveFormatEx()

...and hand it over correctly to acodec. Therefore you can use the attributes format.Channels, format.BitsPerSample and format.SamplesPerSec.

As to your second question, if the sounds are in the same format, you should be able to simply pass them all to enc.encode, one after another.

AndiDog 2010-02-04 11:43:43

Answer 2

A:

Hi - can't provide a definitive answer here, sorry. But there is some trial and error: I'd look at the docuemtation of the pymedia module to check if tehre are any quality configurations that you can set.

And the other thign is that unlike wave or raw audio, you won't be able to simply concatenate mp3 encoded audio: whatever the solution you reach, you will have to concatenate/mix your sounds while they are uncompressed (unencoded), and afterwards generate the mp3 encoded audio.

Also, sometimes we just have the feeling that recordign a fiel to disk and reconvertignit, instead of doing it in "one step" is awkward - while in pratie, the software does exsactly that behind the scenes,even if we don't specify a file ourselves. If you are on a Unix-like system you can always create a FIFO special file (with the mkfifo command) and send yoru .wav data there for encodin in a separate process (using lame): for your programs it will look like you are using an intermediate file, but you actually won't.

jsbueno 2010-02-04 11:50:31

ansaurus

tags:

views:

answers:

encoding mp3 from a audio stream of PyTTS

related questions