I am new comer to DS. For example, I record some of my voices, and let DeepSpeech learn from my words. After that, the DeepSpeech can speak the words out in a tone like me.
I don’t know the difference between “Festival” or “Espeak”. How should I start?
[FYI]
I found a site: resemble.ai , they provides voice clone. What framework do they use to make clone voices?