French - Always addresses

meryll.essig · February 19, 2019, 10:36pm

Hi,

All sentences I get in french are addresses, both when speaking and validating. I looked at the french corpus on the repo and I see many other things such as politics speech, text from novels, etc…

So, I wonder how are sentences selected ? It cannot be fully random, except if the server hasn’t updated his pool of sentences for a while, am I correct ?

I think, by the way, it would be less monotonous if we got different kind of sentences to say randomly.

lissyx · February 25, 2019, 9:33am

Strings exposed for recording are those with the least recordings, I guess we pushed too many addresses and thus the dataset is kinda imbalanced now.

laubern · March 8, 2019, 6:11pm

I had the same feeling of doing only stupid addresses for my 2 first days on Common Voice. Then it changed, and now I get strange and funny sentences. Much better ! Does every body get the same “work profile” ?

lissyx · March 8, 2019, 7:57pm

We might benefit from contributions to augment the size of the dataset of sentences to read, that would help mitigate this poor first experience

Topic		Replies	Views
Are sentences for reading choosen really randomly? Common Voice issue , scripted-speech	3	56	March 27, 2026
I'm almost giving up on the project. Feedback from a big contributor (10000 sentences sent, 7000 listened) Common Voice	24	2328	March 15, 2023
We want your feedback: Improving the sentence collection Common Voice sentence-collection , feedback	34	8976	December 17, 2018
Common voice sentences are the opposite of "common" Common Voice participation , sentence-collection , feedback , issue	27	3892	September 7, 2024
I can't speak sentences in portuguese. There is no phrases for the language Common Voice participation , sentence-collection , feedback , issue , dataset	3	1000	August 31, 2023

French - Always addresses

Related topics