I recently noticed a huge amount of sentences (>200k) waiting in the review queue, presumably added from a dictionary for Persian.
Although generally not a bad thing, most of the words are obscure/quiet rare and with a lot of spelling mistakes. And the problem gets worse knowing the current existing sentences are quite biased (the amount of colloquial sentences is below five percent compared to (usually old) text-book and written Persian, which differ a lot in form), and this huge review queue makes it impossible to add more diverse sentences on small frequent basis, considering this small contributor base.
This is in the Sentence Collector, right? If so, we can delete those if they do not provide value. Do they all have the same āsourceā displayed? If so, what is it? And are all the āsentencesā from that source to be removed?
Yes they are from the sentence collector. I canāt say that they donāt provide any value, but well at the current rate reviewing them all is almost impossible and the quality is not high enough for bulk submission, so they might be doing more harm than good.
All the sentences I have been reviewing recently mention āself-prepared sentencesā, which I suspect constitute a big portion of those 250k submissions considering they are all dictionary-like entries (but of course I canāt be sure since I cannot see all the sentences).
It would be nice if we could temporarily āholdā these submissions for later reviews and revisions (maybe by simply putting them in a separate directory that are not exported to CV?), but if thatās not possible I think removing them might be the only option at the moment, provided that this source is actually the cause of this huge queue.
So there must be more sources that recently got added. As I do not have access to the database, I canāt say which ones those are though.
It would be nice if we could temporarily āholdā these submissions for later reviews and revisions (maybe by simply putting them in a separate directory that are not exported to CV?)
That is currently not possible. Something like a quarantine might be a good idea for the future (just might), but currently there is no flag for that and Sentence Collector has a single database. Though thatās just a technical limitation, identifying the actual sources and submissions that contain these sentences is way trickier.