Hi, so people are recording TTS clips, I’m rejecting them since it doesn’t make sense to have them in the dataset. I’m worried this will slow down the validation process of actual clips.
Hi, so people are recording TTS clips, I’m rejecting them since it doesn’t make sense to have them in the dataset. I’m worried this will slow down the validation process of actual clips.
A post was merged into an existing topic: What if people are using text-to-speech to record?