Weird Audio Generated

haqkiemdaim · July 9, 2019, 12:48pm

Hi there! Would like to ask a question regarding the issue on the title.

Before that, below is my dataset specification:

° a total of 5 hours dataset
° contain >30 seconds for most of the audios

When the model is ready, i tried to test it with few words that has the highest frequency in my corpus:

I picked the highest word ‘saya’ and the audio generated very long for only 1 word.

Only the first 1 seconds can hear the word ‘saya’ and the next few seconds became unknown sound.

Also during testing, i was getting below error:

(Stopped with decoder…)

2 question. What could be the reason of me having that kind of mentioned audio ?

AND

is the stopped decoder issue due to long sentence audio (dataset) ?

erogol · July 9, 2019, 4:41pm

your attention or stopnet does not work properly so network reaches the maximum iteration.

haqkiemdaim · July 9, 2019, 6:57pm

I’m sorry but what caused for the issue you mentioned ya ?

erogol · July 10, 2019, 9:33am

cannot tell without knowing more. You need to find out yourself. Ya!

Topic		Replies	Views
Audio generated with TTS is a Bip TTS (Text-to-Speech) learning	4	2129	March 10, 2021
Noise in audio files TTS (Text-to-Speech)	2	368	December 14, 2020
[TWEB dataset] TestSentence audio is progressing while synthesized audio is noisy TTS (Text-to-Speech)	1	307	July 28, 2020
Sentences which trigger an endless loop TTS (Text-to-Speech)	10	1392	December 17, 2020
Text produced has long strings of words with no spaces DeepSpeech	22	4013	April 30, 2018