[Technical feedback needed] Wikipedia extractor script beta

mkohler · March 1, 2020, 11:32am

I’m not sure I understand this. How many sentences do you get just by running the export with your current rule set? These regular expressions are part of Add basque rules and blacklist by Thadah · Pull Request #95 · common-voice/cv-sentence-extractor · GitHub, correct?

What is the error rate without any manual work applied?

Topic		Replies	Views
Future of the Sentence Extractor - Your input is required Common Voice sentence-collection	11	1827	May 28, 2021
Bulk sentences submission from Wikipedia Common Voice sentence-collection	4	586	August 12, 2024
Question about CV Sentence Extractor quality and your experience Common Voice	18	1562	August 30, 2023
About the new English Sentences Common Voice feedback , issue	37	3327	May 31, 2019
Scraper - Automatic sample sentences extracted in Pull Request Common Voice	1	1568	March 5, 2020