I was wondering if there is a way to download for example the dutch and german common voice datasets and to find out if there is any recording that exists in both languages? (for example a recording from dutch, and see if the translation is available in german)
I’m not sure how much that helps you, but at https://www.github.com/mozilla/common-voice/tree/main/server%2Fdata%2Fde you can find all the sentences that may be in the German dataset (no guarantee that they have been recorded yet though), similarly you should be able to find the Dutch sentences. Other than that, I’m afraid you won’t have other choice than to download the datasets for both languages and look through them that way.