ansaurus

Question

Interpretation of DirectSound buffer elements from mic capture device

Answer 1

A:

From here, floating point PCM values are from [-1, 1].

MSN 2009-03-04 22:08:30

Answer 2

+2 A:

As MSN said the samples are in 32-bit floats. To detect a silence you would normally calculate the RMS value: Take the average of the squared sample values over some time interval (say 20-50 ms) and compare (square root of) this average to a threshold. The noise inherent in the microphone signal may let single samples reach above the threshold while the ambient sound would still be considered silence. The averaging over a short interval will result in a value that corresponds better with our perception.

Han 2009-03-05 08:22:34

Answer 3

A:

In addition to Han's suggestion to average samples, als consider calibrating your threshold value. Under different environments, with different microphones and different audio channels, "silence" can mean a lot of things.

The simple way would be loowing to configure the threshold. Alternatively, allow a "Noise floor measurement" where you acqurie a threshold value.

Note that the samples are linear, but levels in audio processing are usually given in dB. So depending on yoru target audience, you may want to convert readings and inputs to/from dB.

peterchen 2009-03-05 09:28:13

ansaurus

tags:

views:

answers:

Interpretation of DirectSound buffer elements from mic capture device

related questions