So I am completely new to ML and TTS, and am trying to learn how to create text to speech for right now maybe 20-30 phrases, but train it more as I get more data. Is this feasible in a custom language? My current track of steps is,
creating a LJSpeech type dataset with my 20-30 phrases
I am going to skip adding my custom slphabet since I plan on using phonemes for training