With the new version of the Metadata Viewer, I can see the following:
Although a pretty new feature, people started to use the domain information:
Some of the languages have very high value under unvalidated sentences. With small corpora, this can be expected, but for some like Arabic, Persian or Thai, the values are very high. Again, here, we cannot distinguish between invalidated and not yet reviewed thou.
On the global values, the validated Hours percentage keeps dropping… The recordings are there, but lead communities should validate them. I think we need a global event for recording & validation, as voiced in previous months.