Length of audio file and text

Talha_Anwar · February 18, 2021, 3:58pm

I have checked the Arabic dataset upload and found that the deepspeech learn on the basis of sentences. I want to know instead of a single sentence in a single audio file can we feed multiple sentences in single audio file?

othiele · February 18, 2021, 4:22pm

Yes and no. Ideally you have shorter sequences of 5-10 seconds to train. You can try longer audio, but it can be problematic.

Talha_Anwar · February 18, 2021, 4:40pm

thanks. is there a way to automatically split audio as we split para into sentences.

Plato · February 18, 2021, 5:27pm

you can look at aeneas to help you split each audio into a seperate sentence. The tool alligns a given transcription to the timeframe it was said in a given audio. I’ve used it for english and i was happy with the results that i got. It seems there’s support for arabic (https://www.readbeyond.it/aeneas/docs/language.html) so you might want to try it out. goodluck!