@h_caulfield, @nukeador any information regarding the following will be greatly appreciated:
-
Is it possible to download this dataset? It seems that it is no longer available from Common Voice Datasets. Although Common Voice Corpus 1 (2019-02-25) is available for download, this version is different from your version since the file names for Common Voice Corpus 1 (2019-02-25) do not follow sample_#+.mp3 convention of your version.
-
Any luck with finding out overlaps since the file name convention has change? Just curious as to whether this has been resolved.
Lasty, although not related to this topic, I wish the Mozilla Discourse support “Accepted Answer” feature like in stackoverflow so that we can know whether it has been resolved. I do not see such feature mentioned in Who has which moderation powers on Discourse?. I guess Mozilla Discourse is different from Q&A type of community i.e. stackoverflow.