Produce different voices for the same sentence

I wanted to know if there is a way to generate different voice samples for the same sentence, using any of the pre-trained models ?


different speakers you mean ? If so, no there is not.

Out of interest, for the same speaker, would the same text always produce the exact same audio output?

Is there a way to produce slight variations, in the same way that people don’t say a phrase in precisely the same way each time?

you can enable the prenet dropout at inference