DRAFT GUIDELINES FOR REVIEWING RECORDINGS
Thank you to all the Community Members that took part in developing the guidelines. The guidelines are now available on Common Voice. For support in localising the page please read this comment.. - From @heyhillary
[Edited to include comments up to 2 April 2019]
Problems with the recording
Misreadings
You need to check very carefully that what has been recorded is exactly what has been written - reject if there are even minor errors. Very common mistakes include:
- missing out âAâ or âTheâ at the beginning of the recording.
- missing an âsâ at the end of a word.
- Reading contractions that arenât actually there, such as âWeâreâ instead of âWe areâ, or vice versa.
- Missing the end of the last word by cutting off the recording too quickly.
- Taking several attempts to read a word.
For example:
-
The giant dinosaurs of the Triassic.
-
Giant dinosaurs of the Triassic.
[âTheâ omitted] -
The giant dinosaur of the Triassic.
[Should be âdinosaursâ] -
The giant dinosaurs of the Triassi-.
[Recording cut off before the end of the last word] -
The giant dinosaurs of the Triassic. Yes.
[More has been recorded than the required text] -
The giant dinosaurs of the Tri- Triassic.
[The first âTri-â is not in the written text] -
We are going out to get coffee.
-
Weâre going out to get coffee.
[Should be âWe areâ] -
We are going out to get a coffee.
[No âaâ in the original text]
Varying pronunciations
Be cautious before rejecting a clip on the ground that the reader has mispronounced a word, has put the stress in the wrong place, or has apparently ignored a question mark. There are a wide variety of pronunciations in use around the world, some of which you may not have heard in your local community. Please provide a margin of appreciation for those who may speak differently from you.
On the other hand, if you think that the reader has probably never come across the word before, and is simply making an incorrect guess at the pronunciation, please reject. If you are unsure, use the skip button.
-
On his head he wore a beret.
[âBeretâ is OK whether with stress on the first syllable (UK) or the second (US)] -
His hand was rais-ed.
[âRaisedâ in English is always pronounced as one syllable, not two]
Background noise
We want the machine learning algorithms to able to handle a variety of background noise, and even relatively loud noises can be accepted provided that they donât prevent you from hearing the entirety of the text. Quiet background music is OK; music loud enough to prevent you from hearing each and every word is not.
-
{Sneeze} The giant dinosaurs of the {cough} Triassic.
-
The giant dino {cough} the Triassic.
[Part of the text canât be heard]
If the recording breaks up, or has crackles, reject unless the entirety of the text can still be heard:
-
{Crackle} giant dinosaurs of {crackle} -riassic.
[Part of the text canât be heard]
Background voices
A quiet background hubbub is OK, but we donât want additional voices that may cause a machine algorithm to identify words that are not in the written text. If you can hear distinct words apart from those of the text, the clip should be rejected. Typically this happens where the TV has been left on, or where there is a conversation going on nearby.
- The giant dinosaurs of the Triassic. [read by one voice] Are you coming? [called by another]
Volume
There will be natural variations in volume between readers. Reject only if the volume is so high that the recording breaks up, or (more commonly) if it is so low that you canât hear what is being said without reference to the written text.
Reader effects
Most recordings are of people talking in their natural voice. You can accept the occasional non-standard recording that is shouted, whispered, or obviously delivered in a âdramaticâ voice. Please reject sung recordings and those using a computer-synthesized voice.
Problems with the written text
Please see Discussion of new guidelines for uploaded sentence validation
Still unsure?
If you come across something that these guidelines donât cover, please vote according to your best judgement. If you really canât decide, use the skip button and go on to the next recording.