What if people are using text-to-speech to record?

feedback
#1

I’ve come across recorded sentences with text to speech. Should I vote them positively or not?

I came across several TTS clips
Discussion of new guidelines for recording validation
(Rubén Martín) #2

Hi @DaDiRa

Welcome to the community discourse! :slight_smile:

This is interesting, I haven’t found this situation, did you have the chance to document which sentences were using this?

I would say that this is not ideal, since this is the same voice over an over again, so having more than 15 minutes of this voice is not super helpful. We really need at least 1000 different and diverse voices for each language, and definitely this is not very diverse.

1 Like
#3

No I didn’t document it but I’ll do it from now on. This happened about 3 times so far in the 55 sentences I voted for.

(Lissyx) #4

I’m pretty sure this is something we already discussed about with @kdavis and the answer was a clear no as much as I can recall. Not only it’s going to not be very good for the dataset, but chances are that this is against the terms of use of the Text-to-Speech service.

2 Likes
(kdavis) #5

If the voice is indeed synthetic the clip should be marked as invalid, and I agree with @nukeador that…

1 Like
(Michael Maggs) #6

I’ve added this to the draft reviewing guidelines, here:

1 Like
(Pedro Lima) #7

Hi, so people are recording TTS clips, I’m rejecting them since it doesn’t make sense to have them in the dataset. I’m worried this will slow down the validation process of actual clips.

(Rubén Martín) #8

@Codigo_Logo_Programacao_e_Inteligencia_Artificial how many of these have you found?

(Pedro Lima) #9

@nukeador About 10 in a set of 150 clips.

(Rubén Martín) #10

@gweber is there a way we can help people identify and flag these ones so we can identify who is sending these?

(Gregor) #11

Unfortunately we don’t have flagging functionality yet, though it’s been requested a couple of times already.