I’ve discovered two sentences which seem to generate an endless looping result from the model. I’ve been using https://github.com/synesthesiam/docker-mozillatts/ so I don’t have an easier python MWE currently, sorry. We had some discussion in an issue as well.
The two sentences (so far) are
you whiskey. uWSGI
Which loops endlessly on “GI gi gi gi gi gi gi” at the end, and
Some important terms you should know.
Which loops on the “know” at the end (giving the odd effect of the computer telling you no no no) and making me feel rather bad for asking the TTS to generate these sounds.
I’ve noticed that in these sentences, if you make any change, it’ll avoid the issue. In my project I’ve worked around it temporarily by simply appending an ‘.’ to the end and re-generating the audio if it’s longer than 20 seconds.
Is there anything on my end that I should be doing differently to avoid this? Model parameters or so?