Older English dataset question

makoto_wada_jp · June 15, 2021, 10:49am

@h_caulfield, @nukeador any information regarding the following will be greatly appreciated:

Is it possible to download this dataset? It seems that it is no longer available from Common Voice Datasets. Although Common Voice Corpus 1 (2019-02-25) is available for download, this version is different from your version since the file names for Common Voice Corpus 1 (2019-02-25) do not follow sample_#+.mp3 convention of your version.
Any luck with finding out overlaps since the file name convention has change? Just curious as to whether this has been resolved.

Lasty, although not related to this topic, I wish the Mozilla Discourse support “Accepted Answer” feature like in stackoverflow so that we can know whether it has been resolved. I do not see such feature mentioned in Who has which moderation powers on Discourse?. I guess Mozilla Discourse is different from Q&A type of community i.e. stackoverflow.

Topic		Replies	Views
Common Voice mid-year release - more data, more languages! Common Voice announcements , dataset	20	2507	August 12, 2019
Looking for Common Voice Corpus English before 2019-02-25 (v1) release Common Voice	6	851	June 21, 2021
Multi-Language-Dataset (Beta) is gone Common Voice issue , dataset	5	645	February 20, 2019
How to Access Old Release Version of Dataset? Common Voice dataset	0	601	September 7, 2020
Speaker ID split between train/test/dev Common Voice dataset	4	1013	February 15, 2019

Older English dataset question

Related topics