Audio files for Deepspeech

Testdeepv · June 24, 2019, 2:58pm

Can we train Deepspeech on audio files longer than 10 seconds (and less than one minute) ?

reuben · June 24, 2019, 3:03pm

Theoretically you can, but it may be trickier to get things working well. Longer audio files will mean lower batch sizes, which can affect convergence, and TensorFlow has some reported numerical instability issues with CTC and long sentences (https://github.com/tensorflow/tensorflow/issues/4193), although I don’t know if files shorter than one minute will trigger that. So, it may be possible, but it may take experimentation to get it there.

Topic		Replies	Views
Can i train the model with longer audio files? DeepSpeech	1	1146	February 9, 2018
Can DeepSpeech process longer audio files? DeepSpeech	5	6425	December 18, 2019
Transcribing longer audio files DeepSpeech	17	2685	February 28, 2023
Longer audio files with Deep Speech DeepSpeech	12	12076	November 21, 2019
Is DeepSpeech not meant for one word audio files? DeepSpeech	27	1493	July 30, 2020

Audio files for Deepspeech

Related topics