Deepspeech silence detection

meghagowda5193 · June 19, 2018, 8:48am

Hi all,

I have audio files which are more than 7 to 10 min long. As deepspeech cant handle this, I am chunking it with 10 sec each. But my audio files don’t have words spoken throughout the audio. So is there a way to identify if there is no speech spoken? right now I am doing it manually. creating segments but eliminating silence parts in my audio.

thanks

lissyx · June 19, 2018, 10:36am

Voice Activity Detection is likely what you are looking for, e.g., https://pypi.org/project/webrtcvad/

elpimous_robot · August 24, 2018, 10:05pm

Hi. I can confirm !! Webrtcvad is very powerful, and works perfectly !
You can, too, select detection level…