How to create a data set to train the model?

I want to fine-tune the deepspeech model for my country’s English-speaking accent. for that, how can I prepare a date set? and how to use that in the training environment.
I am using Google Colab as the training environment. I cant use common voice data set or any other data set available because the English speaking accent of my country is a bit different

1 Like

https://medium.com/@klintcho/creating-an-open-speech-recognition-dataset-for-almost-any-language-c532fb2bc0cf

1 Like