I was wondering: How are sentences selected for the sentence-collection?
By this I mean whether it is made sure that, for example, a certain amount of numeric expressions are contained in the collection:
He ate two apples.
There were 23,921 people.
Pi is approximately 3.1415.
and the same for dates, cities and so on.
Or are sentences more or less selected randomly from different corpora?
Would be interesting to know.