More Data More Languages - Dataset Release

The Common Voice team is thrilled to announce the release of a new dataset for Catalan, Abkhaz, and Esperanto :tada:. You can check out the newly released data by visiting the provided link. The Common Voice team expresses their gratitude to the community and everyone who is contributing to the Common Voice mission.

2 Likes

Thanks, great news!
Just to clarify the headline a little: this is not about a release of the Common Voice dataset for these languages (the big one with audio files), but an addition to the sentence corpus, that gets released to the website.

3 Likes