|
Common Voice 21 dataset now available
|
|
4
|
340
|
May 14, 2025
|
|
Discrepancy in Hours Between Common Voice Datasets Page and Hugging Face Download
|
|
3
|
586
|
August 12, 2024
|
|
Load validated split Hugging Face data?
|
|
11
|
2804
|
June 5, 2024
|
|
Are librivox contributions really being put into Common Voice?
|
|
10
|
1066
|
September 7, 2023
|
|
I can't speak sentences in portuguese. There is no phrases for the language
|
|
3
|
985
|
August 31, 2023
|
|
Are splits currently still regenerated per version?
|
|
6
|
779
|
June 27, 2023
|
|
Dataset 13 release ๐
|
|
3
|
1574
|
March 20, 2023
|
|
Speaker IDS for Speaker Recognition
|
|
12
|
7812
|
October 29, 2022
|
|
Dialect metadata in the Armenian dataset
|
|
11
|
1966
|
October 12, 2022
|
|
Certain English Female Contributor Submitting False Data
|
|
4
|
1036
|
February 8, 2022
|
|
Accessing the extended version of a dataset
|
|
8
|
1560
|
December 6, 2021
|
|
Dataset Release AMA Thread (Active: 4th August 3-4pm UTC)
|
|
12
|
5017
|
August 19, 2021
|
|
Common Voice 2021 Mid-year Dataset Release!
|
|
8
|
2819
|
August 4, 2021
|
|
Older English dataset question
|
|
6
|
1482
|
June 15, 2021
|
|
Dataset versions
|
|
4
|
985
|
June 12, 2021
|
|
Could a specific collection be established for people with speech and/or communication disorders?
|
|
1
|
1357
|
May 28, 2021
|
|
Every sentence with character ะปั shoud be changed to ั in Serbian dataset
|
|
3
|
725
|
May 26, 2021
|
|
When will the new version of the data be available in June?
|
|
1
|
820
|
May 25, 2021
|
|
Very low download speed
|
|
4
|
1464
|
March 31, 2020
|
|
Book-reading mode (aka "ordered sentences collections")
|
|
3
|
1521
|
January 2, 2021
|
|
Explainations about reported.tsv, other.tsv, etc
|
|
0
|
1532
|
October 16, 2020
|
|
How to Access Old Release Version of Dataset?
|
|
0
|
602
|
September 7, 2020
|
|
.wav File Availability
|
|
1
|
1558
|
August 25, 2020
|
|
Empty string sentence in cv-corpus-5-2020-06-22/en/test.tsv
|
|
1
|
780
|
July 23, 2020
|
|
Upper Sorbian dataset download
|
|
6
|
1000
|
July 1, 2020
|
|
Native language in dataset
|
|
2
|
891
|
July 1, 2020
|
|
Read wikipedia and give audio?
|
|
5
|
1009
|
May 23, 2020
|
|
Data labels
|
|
1
|
1388
|
May 18, 2020
|
|
Separate by participants
|
|
1
|
681
|
May 17, 2020
|
|
Language Subsets
|
|
1
|
888
|
May 15, 2020
|