Hi,
Would it be a good idea to read some pages of Wikipedia and give the audio?
What’s the procedure? Where to upload the data? Available for Italian?
Thanks,
A.
Hi,
Would it be a good idea to read some pages of Wikipedia and give the audio?
What’s the procedure? Where to upload the data? Available for Italian?
Thanks,
A.
Thanks for starting this thread. The voice.mozilla.org sentence corpus already includes data from Wikipedia. We’ve imported a maximum of 3 sentences per article (as per legal requirement) and they will show up when contributing your voice.
Thanks for your answer. So what written long(er) sources can I read in Italian?
@a.mascitti you might also be interested in https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Spoken_Wikipedia, the project for recording audio versions of Wikipedia articles.
Another project that could be interesting for you is librivox, audiobooks under public domain: https://librivox.org/
People use both projects to train speech recognition software, so this could also help Common Voice indirectly in the future.
It would be interesting if contributors could link a Wikimedia / Librivox account to their CV account to algorithmically compare the long form and clip-based audio samples.