ASR and TTS for Italian Model with Common Voice data

christian.colonna · November 30, 2019, 4:48pm

Hi,
i would like to train an ASR and a TTS model for italian based on Common Voice Dataset.
I have some questions.

DeepSpeech can only train ASR is correct?
The procedure to follow to train own’s model is the one pointed out in TUTORIAL : How I trained a specific french model to control my robot ? Have i to follow this one?
in positive case, i would like to apply my model to domotic, so is there any kind of pre-processing or sound properties or other stuff i need to know to properly train the model ? Can i find anything i need in the paper https://arxiv.org/abs/1412.5567 ? Or can you suggest me other references?
Can someone give an advice on a good architecture to train TTS ?

thank you a lot!
Christian

lissyx · November 30, 2019, 9:48pm

You should take contact with @Mte90

Please follow the official documentation, this tutorial is good but it’s old and for a specific case. https://github.com/mozilla/DeepSpeech/blob/master/TRAINING.rst

Likely you can have a look at what @erogol does

Mte90 · December 2, 2019, 12:13pm

Hi Christian, we already have the model for italian if you check on https://discourse.mozilla.org/c/voice/it you can find the italian category.

Also if you have telegram our community is there and discussing how to improve it, check @mozitabot and pick the developers group.

Topic		Replies	Views
DeepSpeech with Common Voice Training Data DeepSpeech	7	2540	December 2, 2019
Some Beginner Questions DeepSpeech	1	311	March 29, 2021
Using Common Voice data with DeepSpeech Common Voice	11	7543	August 21, 2021
Common Voice Training DeepSpeech	2	364	June 24, 2021
Preprocesses steps of Common Voice dataset DeepSpeech	1	313	May 8, 2021