[TWEB dataset] TestSentence audio is progressing while synthesized audio is noisy

JDB · July 25, 2020, 9:45pm

Hello Mozillans!
I have been training a Tacotron2 model with the world English bible dataset for over 25K steps now, and the TestSentences show progression TestSentence_3.zip (2.0 MB) , however my synthesized audio is completely different (very noisy)He_is_your_father.zip (2.8 MB)
If anyone could shine a light on why the synthesized audio is of much poorer quality, I’d be grateful!

erogol · July 28, 2020, 9:46am

You need to the audio parameters for this dataset specifically. It’s a male voice and default values would not work for it. Use CheckSpectrograms notebook for finding right values.

Topic		Replies	Views
A test Sentence for LJSpeech Tacotron1 model TTS (Text-to-Speech)	0	676	April 20, 2020
Audio generated with TTS is a Bip TTS (Text-to-Speech) learning	4	2147	March 10, 2021
TestFigures doesn't align TTS (Text-to-Speech)	26	2942	April 6, 2020
Training russian TTS TTS (Text-to-Speech)	9	7173	March 11, 2021
Tacotron2: bad test synthesis results TTS (Text-to-Speech)	1	2421	March 1, 2020

[TWEB dataset] TestSentence audio is progressing while synthesized audio is noisy

Related topics