DeepSpeech with Common Voice Training Data

saravananselvamohan · November 15, 2019, 6:46am

Does the DeepSpeech Model downloaded from the below link is trained with Common Voice Training Data or We need to train and extract the model separately

curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.5.1/deepspeech-0.5.1-models.tar.gz

dabinat · November 15, 2019, 3:46pm

The 0.5.1 model wasn’t trained on Common Voice data.

saravananselvamohan · November 18, 2019, 5:14am

Thanks for reply @dabinat. Whether we can get trained model on Common Voice Data from Online

lissyx · November 18, 2019, 10:01am

What is the question here ?

saravananselvamohan · November 18, 2019, 1:28pm

Whether we can get readymade model which is already trained on Common Voice Data

Mte90 · November 22, 2019, 6:46pm

You have to do it, like we did for Italian https://github.com/MozillaItalia/DeepSpeech-Italian-Model/ or for french https://github.com/Common-Voice/commonvoice-fr/

christian.colonna · November 30, 2019, 4:39pm

Hi, i’d like to train a ASR and a TTS model for Italian on Common Voice Training Data. I have few questions:

Is there already some model available readyToUse? I look at the github repo you link but there’s just the link to DeepSpeech “CodeTrainer” and a tool to manipulate datas. That is stuff just to train own’s model, right?
is this a good step to start with to understand your architecture? https://arxiv.org/abs/1412.5567
Sorry but there’s really a lot of infos on this site forum that i got lost. To train my own model have i to follow this guide TUTORIAL : How I trained a specific french model to control my robot ?
Does DeepSpeech works well only for ASR or also for TTS ? In case one can you suggest also an algorithm or paper or architecture for TTS?
Is some kind of preprocessing required?

Thank you in advance if you would like to help me.

Mte90 · December 2, 2019, 12:12pm

Hi Christian we have the italian category for that https://discourse.mozilla.org/c/voice/it