TLDR: How could we change the 5000 sentence collection requirement to be more inclusive of a language community needs ? e.g low-resourced, not many speakers
At the contribute-athon sessions, we discussed some of the ideas the Common Voice team has for New Language Workflow for Common Voice.
I want to ensure as many people can be involved in this discussion so, I have created this topic.
The language workflow is the process in which a language joins the Common Voice website for voice data collection. See this comment to understand how it works.
We want to improve the language workflow this includes but is not limited to; centralising documentation by including and evolving the Community Playbook onto the Common Voice Website.
To help us we would like to listen to the community thoughts on two questions:
- How could we change the 5000 sentence collection requirement to be more inclusive of a language community needs ? e.g low-resourced, not many speakers
- What documentation did you wish you had or still need to support your language’s journey being launched onto Common Voice for voice data contributions ?
We look forward to hearing your thoughts !