We’re delighted to announce that the Common Voice 21 dataset is now available for release
Common Voice now hosts 134 languages, with nearly 33,500 hours of speech from over 350,000 distinct speakers.
In this release, we’re delighted to welcome Norwegian Bokmål - one of two languages that are the official languages of Norway - the other being Nynorsk. Nynorsk and Bokmål have different heritages - like many similar languages do! Bokmål - literally “book language” is heavily influenced by Danish, from the period when Norway was a part of Denmark. Nynorsk - “New Norwegian” - is spoken more in the western and rural parts of Norway while Bokmål is spoken mainly in urban and eastern areas. A big “hei” to all our Bokmål contributors
A huge thank you to all the data contributors, language leads and communities for making this possible.