So I am completely new to ML and TTS, and am trying to learn how to create text to speech for right now maybe 20-30 phrases, but train it more as I get more data. Is this feasible in a custom language? My current track of steps is,
- creating a LJSpeech type dataset with my 20-30 phrases
- I am going to skip adding my custom slphabet since I plan on using phonemes for training
- Writing a text cleaner
- How do I figure out how to set the parameters?