I gave a talk about Mozilla’s projects and especially about localization, Firefox, voice corpora (Common voice) … and their impact on the language especially minor/indiginous languages like Kabyle.
The meeting gathered language activists from Kabylia (Algeria), Rif (Morroco) and Basque (Spain and France).
Yes more collaboration between teams. A meeting in Tizi ouzou (Kabylia) is planned this year in Tizi ouzou with activists from Basque to talk about localization corpora, content…etc. See how to add the Rifain language and other North African Berber Languages (Some of them have no iso 639-2 alpha 3 code. They have ISO 639-3 codes)…
I think common Voice should be opened for ISO 639-3 languages and not only 639-2 alpha 3 languages.
We also talked about an opensource morphosyntactic corpus project. Maybe with Mozilla.
The sentence collector uses by default ISO 639-1/2. But we also have several languages enabled that are only part of ISO 639-3. If there is no blocker from the Common Voice side to use a ISO 639-3 language, I don’t think there is a problem adding it. @nukeador remind me, have we defined a process on how to request new languages?
1 Like
nukeador
(Rubén Martín [❌ taking a break from Mozilla])
6
We follow pontoon languages, once it’s created there, voice-web and the sentence collection tool follow the code there.