Unfortunately the downloadable datasets don’t have an (visible) publication date.
At the moment I’m wondering how up to date the Dutch dataset is because the download page gives the following stats:
Size 382 MB
Validated Hr. Total 12
Overall Hr. Total 13
Number of Voices 373
However, when I download this dataset i only end up with 366MB.
Thanks for the pointer !
It thought it would be an easy feat, but it clearly is not.
I do still have some questions after reading that entry:
Are the datasets for all languages revisited at the same time, or independent ?
Is there a way to help for the Dutch one ?
nukeador
(Rubén Martín [❌ taking a break from Mozilla])
4
We haven’t agreed on a plan yet. I’ll be working on a proposal to deliver to the team so we can have sooner dataset releases based on what’s more helpful for the community. I’ll open a topic about it soon.
1 Like
nukeador
(Rubén Martín [❌ taking a break from Mozilla])
5
Today we have released a new version of the dataset and keep improving the automation of the process.