A lot of people have been asking us through Discourse and other mediums like Slack about when Common Voice will be available in their language. Well, this update is for all of you!
First, the big news: we are aiming to launch multi-language Common Voice by the week of May 7.
However, there is still a lot of work to do before we can collect voices in any new language, and we need your help!
The first part we need help with is translating the content of the website. Some of you may have noticed that this work has already begun in Pontoon, Mozilla’s tool used to help translate Firefox and Mozilla.org. If you would like to join the translation effort for Common Voice, or add a new language, please reach out to me at mikey(at)mozilla.com.
Second, we will need sentences in these new languages for people to read into Common Voice. This part is a little more tricky, as there are many considerations for creating a complete group of sentences for people to read. That is why we are working with a group of language and speech experts (which we’re calling the Common Voice Speech Advisory Group) to make some guides for collecting and/or writing these sentences. We expect to have that guide up in a public location by the end of April.
In the meantime, you can help us look for sentences to read in your language by searching for public domain texts. Possible sources include copyright free material like government proceedings, movie or drama scripts, perhaps radio or podcast transcripts, etc. We hope to have more information soon about how to look for this public material, but in the meantime feel free to use this thread to ask questions and discuss places to look.
Lastly, we would like to thank everyone for their interest and help with the Common Voice project so far. Without you, this project couldn’t exist. Already we are seeing the English Common Voice data being used in speech engines and university research. We hope that by going multi-lingual we will empower whole new communities to take part in voice technology. Help us make that happen!
With more to come soon,
Michael & the Common Voice team