DeepSpeech with Common Voice Training Data

Hi, i’d like to train a ASR and a TTS model for Italian on Common Voice Training Data. I have few questions:

  1. Is there already some model available readyToUse? I look at the github repo you link but there’s just the link to DeepSpeech “CodeTrainer” and a tool to manipulate datas. That is stuff just to train own’s model, right?
  2. is this a good step to start with to understand your architecture? https://arxiv.org/abs/1412.5567
  3. Sorry but there’s really a lot of infos on this site forum that i got lost. To train my own model have i to follow this guide TUTORIAL : How I trained a specific french model to control my robot ?
  4. Does DeepSpeech works well only for ASR or also for TTS ? In case one can you suggest also an algorithm or paper or architecture for TTS?
  5. Is some kind of preprocessing required?

Thank you in advance if you would like to help me.