[Technical feedback needed] Wikipedia extractor script beta

I’m not sure I understand this. How many sentences do you get just by running the export with your current rule set? These regular expressions are part of Add basque rules and blacklist by Thadah · Pull Request #95 · common-voice/cv-sentence-extractor · GitHub, correct?

What is the error rate without any manual work applied?