Is it possible to “generate” our own sentences and add them to the sentence collector? Such as:
Today I will go to London.
Yesterday Bill went to London.
Alicia and Allen are in Amsterdam.
I’m working on Turkish and the data should contain common person names, city names, country names etc. These will be semi-synthetic but I cannot think of any other method to get the data in. And I promise they will be CC0
If this is possible, are there any guidelines for this? How many times each of these names should be repeated to be enough?