Bulk sentences submission from Wikipedia

Gweltaz_DG · August 1, 2024, 3:31pm

Hi,

I’d like to submit a collection of sentences automatically retrieved and filtered from a Wikipedia dump, for the Breton corpus.
I’m in doubt if Wikipedia licensing is compatible with Common Voice requirements (CC0). Could anyone confirm ?

If Wikipedia is an accepted source, is it enough to put “Wikipedia” in the citation field, or should you give the complete URL to the pages for any given sentences ?

Thanks

Topic		Replies	Views
Use of Wikipedia Sentences Common Voice sentence-collection	1	364	August 5, 2024
[Technical feedback needed] Wikipedia extractor script beta Common Voice sentence-collection , feedback	76	8352	July 1, 2020
Sentence Extraction now automated Common Voice	4	1306	March 19, 2020
[Common Voice] Technical help needed to grow our sentence diversity DeepSpeech	0	933	July 30, 2019
Retrieving Wikipedia content under CC0 licence Common Voice sentence-collection	4	1924	August 9, 2018

Bulk sentences submission from Wikipedia

Related topics