This has been answered in Matrix chat but those fly away, so here it is:
AFAIK the recordings should be min 1.5s, max 14s. This is a global setting for CV and changing it would effect all languages. So the solution is not there but in sentence collector: Language based validator:
- Limit word count (default 14 for English - which will be used for languages which do not have a specific validator).
- Limit character count (Usually 100-110 will work fine but this is language dependent).
You can time some sentences by different people and calculate secs/word secs/char etc to put the validator limits.
I tested it up to 135 chars (with 14 word limit) and some people (who speak with good accent/emphasis, elderly people etc) had problem fitting it in 14 sec.
Limiting sentences like this would drop your text-corpus possibilities, so you may like to pre-edit them to divide sentences by “:” for example. As these sentences should be CC0, you can do anything with them, but they should be correct of course.
Another side-note: Very long recordings are usually not good for many voice-AI models. For example Coqui limits the recording length to 10 seconds by default. So long sentences will be thrown away anyway…