You are asking wonderful questions being a new contributor @elearningbakery ![]()
Q1: A new FULL version for ro will be published regardless of VOICE contributions. If no contribution, it will be the same as previous (except some metadata, e.g. new sentences, new reported sentences, new validations etc. A DELTA version is only released if a NEW VOICE contribution is done in that 3 month period. I have an open-source script to merge a previous FULL version with new DELTA, which saves bandwidth and diskspace, especially good for large datasets.
Q2: MCV is for dataset creation, not for AI models and applications using them. You should look around (e.g. HuggingFace, OpenAI, nVidia, Kaggle etc) to find if somebody worked on Romanian. In here, we have (try to have) language communities, who can do local/global campaigns, do some fund-raising and give tokens of appreciation, etc. E.g. currently I’m helping Circassian languages (ady & kbd - a minority language in Turkey) and trying to build communities, give training, design and implement campaigns, etc (search Google for “#CommonVoice” “#Circassian”). There is a COMMUNITIES.md document people post if people have such meeting points, and here and here are some of my views on communities.
I personally created a 3D Voice-Chess application in the past to show how nicely their contribution can help AI when I was managing a campaign for Turkish dataset (old and non-functional now).
Q3: Multi-part question, I’ll answer that below…
Q4: Yep, Pontoon. Make sure you also joined the Matrix channel for Common Voice to get quick support/answers. Here are Managers/Translators for Romanian. If your suggestions do not pass, try to contact them, else write to the Matrix channel, the team will help you.