currently I’m working on a paper at university trying to assess the quality and results with the common voice german dataset (clips.tsv.zip, de.zip), but since yesterday these files are not on your S3 Storage anymore.
Instead there are now other files listed like “de.tar.gz”, which are really small in size…
Any chance, the old files may come back soon or another complete, updated dataset is coming up?
I received a mail from Lindsay containing links to the new speech-dataset (release date: 2019-02-13). However it only contained the newly released audio data. Is there a way to also get the newly updated clips.tsv.zip?
Hmm… any chance I can get the old clips.tsv.zip file again?
It still seems to be unavailable at the moment
I want to continue my work on CorporaCreator, but without any data, I cannot do so…