Common Voice Localisation Overhaul

gina · August 7, 2024, 10:59am

Hi everyone

A few weeks ago, we announced our plans to overhaul Common Voice localization. I’m pleased to share that the process is now complete, and you can find more details in our latest blog post here.

To summarize: The overhaul of Common Voice localization has significantly reduced the workload required for communities to start collecting data. Previously, localizing 824 strings across the entire user interface was required, but now only 300 core strings need to be localized. This change allows communities to begin data collection earlier and more efficiently, accommodating a range of data collection goals, from fine-tuning existing models to training new ones. The goal is to make Common Voice more accessible to languages with limited resources, particularly low-resourced languages.

Thanks
Gina

Topic		Replies	Views
Streamlining Localization and Reducing Barriers for Common Voice Communities Common Voice	3	572	May 22, 2024
Welcome to 2024 – A New Year of Voice Contributions! Common Voice	0	519	January 17, 2024
Last Weekly Update 2021: Celebrations Common Voice	0	1116	December 17, 2021
Common Voice mid-year release - more data, more languages! Common Voice announcements , dataset	20	2488	August 12, 2019
Common Voice Dataset Release - Mid Year 2020 Common Voice announcements	16	24145	August 21, 2020

Common Voice Localisation Overhaul

Related topics