I also think there is a danger of under performance in the trained model for female, children and old people voices as they are underrepresented.
The best we can do is to spread the existence of Common Voice hoping more women are willing to contribute.
I guess we could bet on the feminist networks as they are very active on social medias and could take that danger seriously. I mean, there is no doubt feminists would feel concerned if you tell them we are going to live in a voice controlled world where AIs can only understand male voices.
I think there might be a bias over the population that hear about Common Voice, mostly because it is spread though IT places like MyCroft or inside Mozilla.
The bias in contributors might come from those biased places (which are mostly represented by males aged between 20 and 40 years old).
I would add that there is more globally an issue on spreading the word about Common Voice.
For exemple, we are 67 millions people in France and only 1300 contributors for CV with maybe a maximum of 200 regular contributors.
I think that bias (males 20-40) could be removed if a lot more people hear about CV.
My two cents on how we could get more people to know Common Voice :
- Share it on social medias
- Talking about it with people we know
- Do events
- Warn feminists hoping they spread the issue
I am sorry if that’s obvious propositions but I can’t see any more yet.
PS: I would like Mozilla to know that if you are looking for people to promote Common Voice in events in France, I would be happy to help there.