Release of Latvian dataset

As you might know in recent months there are significant contributions to Latvian language in Mozilla Common Voice, when could we expect any newly released dataset?

1 Like

It’s been so exciting to see the Latvian contributions coming in! I also can’t wait. We should be able to see these new contributions coming out in the dataset release released late June/early July.

Dataset got published on 28th of June.

Question regarding this page:

The languages page states that only 48% has been validated, yet the mentioned page has not offered me to validate any entries for the past few days, only to speak new ones. Is everything working fine?

Generally yes, but Latvian did hit one wall. There is a PR on github to overcome this. Please also read my comment there about the reason:

1 Like