What is the ideal decibel? Do we need to adjust volume of datasets?

kouohhashi · May 10, 2019, 7:33pm

Hi, I’m training speech recognition model for Japanese with my own datasets.

Parts of my datasets are very low audio.
Should I increase volume of such audios before training?
If I normalize audio, is it better?

What average decibel is ideal?
I mean I may have to decrease volume when too loud.

Thanks in advance.

kdavis · May 11, 2019, 5:25am

When used in production do you expect the volume of the audio to change significantly or do you expect all the audio to be approximately of the same volume?

kouohhashi · May 11, 2019, 3:46pm

Volume can be approximately the same even though i’m not sure i can release it as a real product…

Thanks for your advice.

Topic		Replies	Views
Increase the Volume of the Voice during Inference TTS (Text-to-Speech)	3	744	June 21, 2019
The immunity of the model with respect to different audio levels DeepSpeech	0	345	December 9, 2020
Normalisation (sound_norm) TTS (Text-to-Speech)	5	700	October 31, 2019
Normalized samples Common Voice feedback	2	1202	May 6, 2019
I am frustrated TTS (Text-to-Speech)	2	1244	September 12, 2021

What is the ideal decibel? Do we need to adjust volume of datasets?

Related topics