Preparing data for finetuning

swarajbadhei · June 20, 2020, 7:03am

I have Indian accent English audio data and corresponding transcripts. I want to use that to finetune the pre-trained model but don’t know how to prepare it for deepspeech(.tsv files). Please help.

othiele · June 20, 2020, 9:12am

Please do some research before asking for help

https://discourse.mozilla.org/t/what-and-how-to-report-if-you-need-support/62071/2

swarajbadhei · June 20, 2020, 9:45am

My apologies for the silliness sir.
My configuration is as follows.
DeepSpeech version : 0.7.1
OS : Ubuntu 18.04
Python Version : Python 3.6.9
Tensorflow version : 1.14.0

Issue is I have collected a dataset containing the audio files of Indian accent and corresponding transcripts for the same. But as the finetuning requires specific files for that, I need to know how I can prepare them from the data I have.
Thank you

othiele · June 20, 2020, 12:19pm

Please use the search function and read the documentation. You need to prepare the data like all data for DeepSpeech.

lissyx · June 22, 2020, 10:34am

This is also not aligned with the doc. If you are working with DeepSpeech 0.7, then you should have TensorFlow r1.15.

As @othiele said, please refer to the docs, there is no special file format for fine-tuning, you need to pass a set of CSVs as for any training. You can look at the (numerous) importers for examples.

Kunal_Jain · August 11, 2020, 6:09pm

Hi Swaraj. Can you share your experience on fine-tuning Deepspeech for Indian accent English? Can you share your results?

swarajbadhei · August 12, 2020, 8:46am

Hello Kunal,
Thanks for your interest. My work has not been completed as of now. Once I am ready with presentable results, will surely get back to you.
Happy Coding