I’m currently developing another Common Voice Telegram bot, this time with Node.js, Request, and Telegraf.
I have some questions to ask:
- Does the production server prohibits API access from specific UA (e.g. curl command), and/or implements an API key / CSRF token for client apps?
- I have found a flaw in the API server where I can request data from unreleased languages (such as Bahasa Indonesia) to speak, submit, and review. Should I include them provided that some have included their sample sentences?
- Aside from Pontoon data in
./localization
, where can I get a list of (both released and unreleased) languages (as JSON) in Common Voice? It might be useful to identify the unreleased ones as in (2). - Does the Common Voice Sentences Collector webapp has a separate API?