Train Deep speech with Indian english

bhadwal.abhishek · October 17, 2018, 10:12am

Hi,
I was working with deep speech2 to convert speech to text. Data I used is Indian English.

But the Accuracy is not at all good as it doesn’t pick the word right, although is sound or looks somewhat same.
My question is say I have enough data (Indian English ) and I want to train it
will I require to make any other changes in the code or something like vocab file, language model. Or it’ll give me proper or good result with just training.

Thanks in Advance.

muruganrajenthirean · October 18, 2018, 9:24am

sir you have to fine tune DeepSpeech v0.2.0 pretrained model with your own audio files( recommonded data augmentation).

i have already faced that same problem. now my finetuned model working fine. do this sir.

lissyx · October 18, 2018, 9:26am

For context, using a french dataset produced from audiobooks on top of english model did provide interesting results with around 100 hours of data.

@bhadwal.abhishek Can you provide more context on your case? Amount and source of data, any pre-processing happening, your language model, etc.

bhadwal.abhishek · October 22, 2018, 10:00am

Thanks buddy. But what was the source of your data. I mean from where did you take the data from and what was the size of your data set .

bhadwal.abhishek · October 22, 2018, 10:02am

Hi buddy,
data and all i still have to collect.
Can i use the same language model which have been used for Deep speech 2 pre trained model ?

muruganrajenthirean · October 22, 2018, 10:02am

i recorded my own audios 1500. and augment different pitch,distance etc…get 20000 more audio files.

bhadwal.abhishek · October 22, 2018, 10:03am

okay,
But other than that did you do anything else I mean did you create your own language model or any changes any where in deep speech ?

muruganrajenthirean · October 22, 2018, 10:04am

yes i am using DeepSpeech v0.2.0 pretrained model sir. and i am finetuning this with my audio files.

bhadwal.abhishek · October 22, 2018, 10:04am

And how much time and what hardware you used for the same .

bhadwal.abhishek · October 22, 2018, 10:05am

are you done with it .?
what was the accuracy you got in the same ?

muruganrajenthirean · October 22, 2018, 10:06am

4 GB GTX 1500 series GPU. it is taking 35 hours continues training sir.

are you done with it .?
what was the accuracy you got in the same ?

yes i did this model with my own audio, it gives good results for indian english accent.

bhadwal.abhishek · October 22, 2018, 10:08am

And when you said you have done some fine tuning …
does it mean just adding the new data set on top the already pre trained model or something else also ?

muruganrajenthirean · October 22, 2018, 10:10am

just adding the new data set on top the already pre trained model

yes pretrained + own audiofiles( fine tune) generating a new model.

bhadwal.abhishek · October 22, 2018, 10:11am

thanks a lot buddy you are really a life saver…
pls let me know if you do social service to donate the data set : )

muruganrajenthirean · October 22, 2018, 10:15am

my datasets only a particular purpose only recorded audio. that is not covered all. this is one disadvantage for us. if you want more data, you tried to record different kind of speaker and do data augmentation sir.

i have a code for data augmentation UI. it is very easy to handle augmentation process.

if you want data please do this. it will help to you lot.

come for ready to help page.

bhadwal.abhishek · October 22, 2018, 10:19am

okay …
thanks a ton brother

muruganrajenthirean · October 22, 2018, 10:19am

bhadwal.abhishek · October 23, 2018, 3:58am

hey buddy … i was wondering if you could share your pre trained model …
as it requires alot of hardware … and it ll take time…
i just wanted to see how it 'll work …
if it is okay with you .

Thanks

Sudarshan.gurav14 · April 17, 2019, 12:44pm

@bhadwal.abhishek hello sir how to train our dataset can you please help me?
I am fresher I want to learn how to retrain the deep speech model for Indian accent? @lissyx when I am doing that I was facing an error which is key_layer/_1 bais not found in checkpoint 0.4.1 I am continuing the training from release model

lissyx · April 17, 2019, 1:22pm

Please continue training using the proper git branch v0.4.1 and not master

Topic		Replies	Views
How can i improve Indian accent accuracy for pretrained model v0.2.0.? DeepSpeech	12	2693	April 1, 2019
When will you release deepspeech pretrained model v0.2.0? DeepSpeech	15	2531	August 20, 2019
System requirements for training Indian accent English over DeepSpeech pre-trained model checkpoints? DeepSpeech	22	3667	November 6, 2019
Fine tuning data requirements DeepSpeech dataset	5	2386	May 11, 2019
Deep Speech 2 training the data on top of the pre trained model DeepSpeech	3	674	October 29, 2018

Train Deep speech with Indian english

Related topics