Speaker encoder used with release MultiTTS models

htu · November 17, 2020, 3:47pm

Hi ,

I am looking to use the released Multi-Speaker-Tacotron2 model for a project, but I am unsure which speaker encoder was used to generate the training data for the model.

Is it the one that is downloaded in the sample notebook and used to clone a voice, or is it the released Speaker Encoder model? Or is it a different encoder I can find somewhere else?

(I guess this question could also be rephrased as which encoder created the speakers.json for the released model?).

Thanks,
Henry

erogol · November 20, 2020, 12:36pm

just use the one in the notebook