Differences between data from Huggingface dataset and download dataset?

@jesslynnrose Thank you for your answer. Sad to hear that there’s no official way so to say to confirm if the rehosted versions are the same

@bozden Thank you very much for investigating the dataset as well. Maybe I should have clarified that I also checked myself on Common Voice 8 if the two sources have identical information, but I did so using pandas DataFrames and I wasn’t sure if the answer is fully correct. Nevertheless, I wanted a sanity check and I am happy to see that, at least for CV 13, the 2 sources have the same files. Cheers!