Bad cases or artifacts for Tacotron2 + Melgan vocoder, Any suggestions?

guang · March 21, 2022, 12:25pm

I have trained a Melgan vocoder using my own data, but when its used for end-to-end TTS, some of the synthesized results (about 3% utterances）has some artifacts (noise). In details, the mel-spectrum in corresponding ares discontinuous, shown as follows:

Any suggestions to improve the this?

Topic		Replies	Views
Any success with mel-gan? TTS (Text-to-Speech)	0	411	January 15, 2020
No output on generating voice TTS (Text-to-Speech)	13	2307	November 7, 2020
Tacotron2 + PWGAN produces Deep/Muffled Voice TTS (Text-to-Speech)	9	2989	June 7, 2021
Training russian TTS TTS (Text-to-Speech)	9	7054	March 11, 2021
Tacotron2 and multiband-melgan as vocoder TTS (Text-to-Speech)	4	910	December 12, 2020

Bad cases or artifacts for Tacotron2 + Melgan vocoder, Any suggestions?

Related topics