I agree to publish these sentences under the cc-0 license.
https://pastebin.com/jH7edz3m
Thanks for creating this wonderful project, I hope it turns out to be great!
I agree to publish these sentences under the cc-0 license.
https://pastebin.com/jH7edz3m
Thanks for creating this wonderful project, I hope it turns out to be great!
Alright I’ve added your sentences Anoian, Sonny, and Kusaha! Thank you for your help!
Yeah, we definitely want slang, as well as terms like gay, lesbian, and homosexual. More than just style, we need to make sure we cover all the words we can. In terms of sentence, I agree we need more diversity there as well.
For sentence sentence length, I think around 20 syllables or less is a reasonable length.
Thank you!
I did some sentences. I’m not sure if they’re the kind of thing you need or if you’re still needing contributions, but I hope they’re of some use. I agree to releasing them to the public domain with a cc-0 licence
Honestly, I find this a little troubling. A true language model would include all words, offensive or otherwise. As someone who is depending on voice recognition heavily to get work at my normal job done, and interact with friends and family, I find it very irksome that some words are not recognized as well. I feel censored. How would you feel if your keyboard refused to produce a word that you typed?
There certainly should be room for having offensive words in this language model set.
Wrote some sentences here and tried to include more slangs and female pronouns: https://pastebin.com/94aLhZxT
Here are some short common sentences. They each reside in multiple books on Project Gutenberg from a random collection of works I downloaded from different authors. https://pastebin.com/XRpZgdbw
Would also help for sentiment analysis.
Hi, @mhenretty
Because of @fred_trotter 's comment, I created some health-related sentences. https://pastebin.com/6Y797eNp. Let me know if you need more contributions.
I’ve created some sentences here - maybe not simple enough? Let me know, I’d be happy to supply more
I created an extra thread to discuss this topic:
Hi! Today I explored some interesting cases in terms of pronunciation. I believe these: https://pastebin.com/bTKWpHiU
can help cover a few potential bugs of machine understanding.
Hi, @bavencope! My name is Janet, and I am a volunteer for Project Common Voice. Thank you so much for your contribution. I’d like to let you know that your sentences have been added.
Hi, @pro.gadget! My name is Janet, and I am a volunteer for Project Common Voice. Thank you so much for your contribution. I’d like to let you know that your sentences have been added. Cheers!
Hi, @tlcoles! My name is Janet, and I am a volunteer for Project Common Voice. Thank you so much for your contribution. I’d like to let you know that your sentences have been added. Also, I think it would be awesome to have sentence that touch on women’s health, but that’s just my two cents. Cheers!
Hi, @RLarissa! My name is Janet, and I am a volunteer for Project Common Voice. Thank you so much for your contribution. I’d like to let you know that your sentences have been added.
Hello, @Kieran_Drew! Thank you so much for your contribution. Just letting you know that your sentences have been added.
https://github.com/mozilla/voice-web/commit/86f5769fad99d9bb844fb2eb61e4211413856ca9