Hi @erogol and Mozilla TTS members,
We already have a decent quality and fast TTS for conversational applications using Tacotron2 + Melgan architecture.
We now want a very high quality TTS wherein synthesis time is not a constraint at all. Please let us know which pipeline will be most suitable
Also, what all parameters we can tune to make it high quality. Like shall we increase the sampling rate to higher value than 22kHz. Shall we increase the melbins to 160 instead of 80.
Does multispeaker data helps in achieving this. If so, which pipeline?