I’ve come across recorded sentences with text to speech. Should I vote them positively or not?
Welcome to the community discourse!
This is interesting, I haven’t found this situation, did you have the chance to document which sentences were using this?
I would say that this is not ideal, since this is the same voice over an over again, so having more than 15 minutes of this voice is not super helpful. We really need at least 1000 different and diverse voices for each language, and definitely this is not very diverse.
No I didn’t document it but I’ll do it from now on. This happened about 3 times so far in the 55 sentences I voted for.
If the voice is indeed synthetic the clip should be marked as invalid, and I agree with @nukeador that…
I’ve added this to the draft reviewing guidelines, here:
Hi, so people are recording TTS clips, I’m rejecting them since it doesn’t make sense to have them in the dataset. I’m worried this will slow down the validation process of actual clips.
@Codigo_Logo_Programacao_e_Inteligencia_Artificial how many of these have you found?
@nukeador About 10 in a set of 150 clips.
@gweber is there a way we can help people identify and flag these ones so we can identify who is sending these?
Unfortunately we don’t have flagging functionality yet, though it’s been requested a couple of times already.