Common Voice 22.0 release πŸŽ‰

We’re so excited to be announcing that the 22nd dataset release for Common Voice is now available for download here.

Common Voice 22.0 has an added 281 hours of speech data, bringing the total number of hours to 33,815. This release has also seen a jump in 296 newly validated hours, with a total of 22,640 of validated hours of clips. This release welcomes Aromanian(rup), Tajik (tg), Venda/Tshivenda (ve). This brings the total number of languages available in this release to 137 different languages.

We’re all so proud to work supporting such dedicated language communities, thank you all so much.

2 Likes