Would it be possible to use librevox?

(Rain1) #1


Do you think it would be possible to make use of some of the recordings from librevox? https://librivox.org/

Just an idea, I do not know how practical this would be.

(Michael Henretty) #2

Yes we will! Thank you for the suggestion :slight_smile:

(Rain1) #3


Another possibility is to take the samples from https://en.wiktionary.org/wiki/Wiktionary:Main_Page - they have recorded many many single words. It might be a good little addition to the data set.

(Rain1) #4

This might also be a good source https://tatoeba.org/eng/sentences/show/2544351 apparently they have 6,128,636 sentences in 322 languages

(Michael Henretty) #5

Thanks @rain1! We are definitely working with Tatoeba. That is a great project!