Topics tagged dataset

Topic	Replies	Views	Activity
Common Voice mid-year release - more data, more languages! Common Voice announcements , dataset	21	2063	August 12, 2019
Portuguese dataset Common Voice dataset	2	944	August 1, 2019
Is dataset of acoustic model subset of dataset of language model? DeepSpeech dataset	2	380	August 1, 2019
How can one download the German dataset? Common Voice dataset	4	846	June 12, 2019
Downloading 20gb is tough on weak networks. Alt download method? Common Voice dataset	3	747	June 12, 2019
Add Basque to the dataset page Common Voice dataset	7	854	June 12, 2019
Dataset downloads Dutch Common Voice dataset	5	1054	June 12, 2019
Dataset releases - What's more valuable for you? Common Voice feedback , dataset	10	1852	June 12, 2019
Subpar data uses Common Voice dataset	8	1207	June 5, 2019
Importing large annotated database of CC0 speech data in Swedish? Common Voice sentence-collection , dataset	3	553	May 28, 2019
Common Voice datasets (Mandarin zh-tw) Common Voice dataset	3	728	May 23, 2019
Privacy concerns about dataset metadata Common Voice dataset	8	2351	May 16, 2019
What is the ideal decibel? Do we need to adjust volume of datasets? DeepSpeech dataset	3	444	May 11, 2019
Fine tuning data requirements DeepSpeech dataset	6	2187	May 11, 2019
Add in dataset Sakha language Common Voice dataset	6	956	April 25, 2019
Rejected audio dataset Common Voice dataset	3	584	April 5, 2019
Pre Release Data vs Latest Release Data Common Voice dataset	2	395	April 2, 2019
Gender breakdown of English language dataset Common Voice feedback , dataset	6	1413	March 25, 2019
En 22G dataset, problems about 'path' in .tsv files Common Voice dataset	10	636	March 15, 2019
What are the rules behind 'path' ID generation? Common Voice dataset	1	351	March 11, 2019
Sharing Common Voice Through peer-to-peer Common Voice dataset	17	1554	March 11, 2019
How are the dev/test/train datasets split? Common Voice dataset	5	2142	March 7, 2019
Zero byte files in German language set (new official release) Common Voice issue , dataset	3	434	March 2, 2019
Filtering a specific word / sentence Common Voice dataset	1	308	February 28, 2019
Multi-Language-Dataset (Beta) is gone Common Voice issue , dataset	6	572	February 20, 2019
Speaker ID split between train/test/dev Common Voice dataset	5	832	February 15, 2019
Stats about Common Voice: Kabyle Corpus Common Voice dataset	3	491	February 7, 2019
Timeline for releasing the DeepSpeech models trained with the Common Voice data Common Voice dataset	2	1218	June 23, 2018
Common Voice v1 corpus design problems, overlapping train/test/dev sentences Common Voice dataset	3	1926	April 3, 2018
Sharing the dataset Common Voice dataset	4	1228	November 22, 2017