[Help Wanted] Write some nice, short sentences for people to read


(Tammi L. Coles) #17

Hi, @mhenretty

Because of @fred_trotter 's comment, I created some health-related sentences. https://pastebin.com/6Y797eNp. Let me know if you need more contributions.


(Mark) #18

I’ve created some sentences here - maybe not simple enough? Let me know, I’d be happy to supply more

https://pastebin.com/3wcQEnB8


(Kieran Drew) #19

Hopefully this helps.
https://pastebin.com/1VvGnUiV


#20

I created an extra thread to discuss this topic:


Prompt design
Submitting text to be voiced and collecting submitted audio data?
(Лариса) #21

Hi! Today I explored some interesting cases in terms of pronunciation. I believe these: https://pastebin.com/bTKWpHiU
can help cover a few potential bugs of machine understanding.


(Janet Shih) #22

Hi, @bavencope! My name is Janet, and I am a volunteer for Project Common Voice. Thank you so much for your contribution. I’d like to let you know that your sentences have been added.


(Janet Shih) #23

Hi, @pro.gadget! My name is Janet, and I am a volunteer for Project Common Voice. Thank you so much for your contribution. I’d like to let you know that your sentences have been added. Cheers!


(Janet Shih) #24

Hi, @tlcoles! My name is Janet, and I am a volunteer for Project Common Voice. Thank you so much for your contribution. I’d like to let you know that your sentences have been added. Also, I think it would be awesome to have sentence that touch on women’s health, but that’s just my two cents. :slight_smile: Cheers!


(Janet Shih) #25

Hi, @RLarissa! My name is Janet, and I am a volunteer for Project Common Voice. Thank you so much for your contribution. I’d like to let you know that your sentences have been added.


(Janet Shih) #26

Hello, @Kieran_Drew! Thank you so much for your contribution. Just letting you know that your sentences have been added. :slight_smile:

https://github.com/mozilla/voice-web/commit/86f5769fad99d9bb844fb2eb61e4211413856ca9


(Janet Shih) #27

@mlennox Thank you so much for your contribution! Your sentences have been added. :slight_smile:


(James Fortune) #28

Here are some sentences, not sure if this is the right place to send them:
https://pastebin.com/raw/RJp7bWpu


(Mike Sheldon) #29

Here’s 228 sentences of CC0 licensed dialog:

https://pastebin.com/raw/Jb8grcpV

Hope that helps :slight_smile:


(Mike Sheldon) #30

And here’s 200 sentences of CC0 conversational sentences:

https://pastebin.com/raw/EawMHjEx


(James Fortune) #31

I got 9.8 MB more of CC0 conversation sentences but can anyone help me filter them to remove the incorrect ones and verify them?


Prompt design
(milde) #32

@James_Fortune cool, that’s a substantial amount of utterances! Since most of them are very technical, I’ve filtered the worst offenders automatically:

http://speech.tools/cc0-75k-conversations.filtered

But that’s still 75k sentences, a lot more than the the 7k in the common voice v1 release. Not a bad idea to exclude some more utterances manually, I guess, but it’s a start.


(James Fortune) #33

@bmilde I archived your link with the wayback machine: https://web.archive.org/web/20180116003608/http://speech.tools/cc0-75k-conversations.filtered

I’ll see if I can get an other substantial amount of utterances for Common Voice.


(Janet Shih) #34

Hello, @Elleo! Thank you so much for your contribution. Just letting you know that your sentences (both batches) have been added. :slight_smile:


(Erik) #35

I agree to publish these sentences under the CC-0 license.

https://pastebin.com/raw/MSzCdhqN


(Rubén Martín [Away till Aug. 20th]) #36

To everyone who has participated in this topic, I want to point that we are asking for your feedback on the sentence collection process, thanks!