Hello everybody
The audio files sample rates in each language of Mozilla, Are they equal value? And; What is the value?
Hello everybody
The audio files sample rates in each language of Mozilla, Are they equal value? And; What is the value?
Old version: They are all 48 kHz.
New: Actually, we recently found out it was changed to 32 KHz in 2020 - to fix some server problems.
Thanks
How can I lower this sample rate to 16 khz?
You can lower it easily with offline tools like ffmpeg or any other sound processing utility.
The dataset set is [old: 48 kHz] and that cannot be changed. NEW: In 2020, it was changed to 32 KHz for new recordings.
48 kHz is the highest commonly available sampling rate on common devices. You can downsample without much information loss. If it were e.g. 16 kHz, you would have trouble upsampling due to Nyquist Theorem.
BTW, most Voice AI libraries work on WAV files. The CV datasets include MP3 files. So they must already be preprocessed. This is the best place to down-sample also…
For example, in Coqui STT (the new Deepspeech), it is done here, using sox:
Thank you so much …
Do you know how this is possible using the MATLAB program ?
Thank you
It’s a good reference.