We’re the Cantonese Computational Linguistics Infrastructure Development
Workgroup (CanCLID), a team of volunteers from Guangdong, Guangxi, Hong
Kong, Macau, and the United States. We want to add the Cantonese language to Common Voice.
Which language code should be used?
Which script should be used?
Hant (Han, Traditional) and Hans (Han, Simplified) can be used with Cantonese. However, we recommend adding Hant first because our volunteers are more capable and familiar with Hant.
Hans can usually be generated by mechanical transliteration from Hant. If necessary, we can provide manually checked conversions.
An issue was also opened here: https://github.com/mozilla/common-voice/issues/2926.