Hi there! Would like to ask a question regarding the issue on the title.
Before that, below is my dataset specification:
° a total of 5 hours dataset
° contain >30 seconds for most of the audios
When the model is ready, i tried to test it with few words that has the highest frequency in my corpus:
I picked the highest word ‘saya’ and the audio generated very long for only 1 word.
Only the first 1 seconds can hear the word ‘saya’ and the next few seconds became unknown sound.
Also during testing, i was getting below error:
(Stopped with decoder…)
2 question. What could be the reason of me having that kind of mentioned audio ?
AND
is the stopped decoder issue due to long sentence audio (dataset) ?