nukeador
(Rubén Martín [❌ taking a break from Mozilla])
June 12, 2019, 11:18pm
22
Today we have released a new version of the dataset and keep improving the automation of the process.
The Common Voice Team is excited to announce the release of a new dataset that includes 2,366 total hours of contributed voice data!
The project has seen a spike in contributions and launches of many new languages over the past six months. We want to make sure to release data for use by the community quickly and efficiently. To do this, we’ve moved forward with a mid-year release including all recorded clips in 28 languages, available on the Datasets page on Common Voice .
The new languages bei…
Hello,
I’ve tried to access the form link but it seems not to be accepting responses anymore. Can you please help me about it?
nukeador
(Rubén Martín [❌ taking a break from Mozilla])
April 6, 2020, 10:52am
24
HI, this review is no longer needed, the final dataset was published on
https://voice.mozilla.org/datasets
Thank you.
I wanted to use Corpora Creator with clips.tsv but it seems that the audio files are named differently in the Common-Voice dataset. So, how can I re-create the train dev and test tsv files?