Could you please give me information about training mozillatts with my own dataset? Do I need only voices and its text ? I am completely newbie about TTS.
I prepare a dataset from audio books.Like LJSpeech I have .wav 16 bit PCM Mono sounds. Do I need anything else could you tell me the steps that I should look at.
I checked FAQ but couldnt completly understand steps to preparing own dataset and training model with them. Could you please give me the exact steps?