I don’t know if it’s possible to lookup the individual clips. But this one the clips that i never know what to with. It’s quiet but still barely audible if i increase my volume.
Have had decent amount of clips from this user
I’m very new to the project but was also thinking along similar lines. Do you know if each participant have a fixed id so that it would be possible to filter out consistently quiet/poor quality recordists?
If logged in or using the same device/browser as logged out / not registered, you can pinpoint problematic voices from dataset releases (not on the website). The client_id field in .tsv files is where you should look.
Low voice level happens sometimes, at least in my case, the person is speaking at night, e.g. in a dorm, so that nobody gets disturbed.
Very low energy levels can be problematic. In one case I had to remove a very silent one to be able to create a model. Common Voice is targeting natural speech, and such recordings are not-so-natural (silent whisper). E.g., Whisper checks/eliminates such problematic ones.
In my opinion, if one has problems understanding with a reasonable volume setting, he/she should invalidate it. If you have problems understanding, the DL model will also have problems. But that’s me…
I have to repeat almost every second sentence after listening back, because it is too quiet, although it can be heard clearly. It will be good for repetition. I’m using Google Chrome on an Android phone. I started to read out load the sentences only yesterday.
I usually don’t use my phone for recording but just tried it with my Samsung Galaxy s7e. It is working just fine. On my rather old phone, or on Chrome there are no microphone sensitivity settings. On the Internet, I found some mention of disabling noise suppression in settings, worth trying if it exists.
Or, maybe your speaker volume is too low or you are holding your phone too much away (mine was ~25 cm away)?