Hello,
I try to train my own TTS model, but I am already in 95800 step and I can’t understand a single word from the output. I am lost, and I don’t know what I can do anymore. Any ideas what I did wrong?
Dataset: https://drive.google.com/drive/folders/13CSvJhH68C7BqepyOdRts9w0IY5hqILw?usp=sharing
Model: https://drive.google.com/drive/folders/1V9ILmK31SV8vnN-Et2y1BW6lb8njlvNS?usp=sharing
Any help is appreciated, thanks.
erogol
(Egolge)
February 17, 2021, 1:27pm
2
Can you share a Colab to try the model?
Do you plan to release the model open source?
Is this an open source dataset?
Maybe we can work together on that to make Czech available on the TTS.
I don’t plan on making this one open source because I’m doing it for friend, but I would definitely collaborate on an open source czech model.
erogol
(Egolge)
February 17, 2021, 2:10pm
4
do you know any open dataset ?
Yes, but it is under CC-0
erogol
(Egolge)
February 17, 2021, 2:13pm
6
but does it have a enough size single speaker subset?
mrthorstenm
(Thorsten Mueller)
February 18, 2021, 11:20am
8
It’s hard to tell a number, because it primarily depends on a good phoneme coverage. But most single speaker datasets i know provide a minimum of 16 hours (or more) of voice recordings.
Additionally this might help you:
1 Like
erogol
(Egolge)
February 18, 2021, 4:22pm
9
I feel like bargaining but at least 5 hours is like a good value to fine-tune a pre-trained model.
erogol
(Egolge)
February 18, 2021, 5:25pm
11
let me check and thx for sharing.
Alesh
(Aleš H)
May 8, 2023, 2:57am
12
Hey, anybody finished the czech model? Thinking about doing it myself and would love to work on it with somebody.
skamos
(skamos@centrum.cz)
October 20, 2023, 10:54am
13
I would like to cooperate with you on the model. Just don’t know what to do.
I would like to try and make the Czech model with you. If you’d like please hit me up on matej@aoo.cz