Hi, i’d like to train a ASR and a TTS model for Italian on Common Voice Training Data. I have few questions:
- Is there already some model available readyToUse? I look at the github repo you link but there’s just the link to DeepSpeech “CodeTrainer” and a tool to manipulate datas. That is stuff just to train own’s model, right?
- is this a good step to start with to understand your architecture? https://arxiv.org/abs/1412.5567
- Sorry but there’s really a lot of infos on this site forum that i got lost. To train my own model have i to follow this guide TUTORIAL : How I trained a specific french model to control my robot ?
- Does DeepSpeech works well only for ASR or also for TTS ? In case one can you suggest also an algorithm or paper or architecture for TTS?
- Is some kind of preprocessing required?
Thank you in advance if you would like to help me.