Hey,
Thanks for creating the topic discussion and everyone’s input.
Similar to what has been suggested, you might want to reach out to the contributors who set up Arabic on Common Voice, to even sharing learnings. You can see their contact details on Pontoon.
Regarding Langauge and Accent overall on Common Voice
We want to design a holistic approach to languages and accents that can work across communities. Following community feedback about the current challenges, this is a priority for the 2021/22 roadmap (see post on August open sessions to engage with this!) The team is starting to gather input and insights gathered from research scientists, ML engineers, linguistic experts, and community members to map out new language workflows and accent capture mechanisms. These will be opened up to the community for discussion and user testing, so keep an eye out for those posts!