Would it be possible to use librevox?


(Rain1) #1

Hello!

Do you think it would be possible to make use of some of the recordings from librevox? https://librivox.org/

Just an idea, I do not know how practical this would be.


(Michael Henretty) #2

Yes we will! Thank you for the suggestion :slight_smile:


(Rain1) #3

cool!

Another possibility is to take the samples from https://en.wiktionary.org/wiki/Wiktionary:Main_Page - they have recorded many many single words. It might be a good little addition to the data set.


(Rain1) #4

This might also be a good source https://tatoeba.org/eng/sentences/show/2544351 apparently they have 6,128,636 sentences in 322 languages


(Michael Henretty) #5

Thanks @rain1! We are definitely working with Tatoeba. That is a great project!