Single word utterances better than sentence?

johnycage · August 28, 2020, 6:12am

I believe single word audio recordings of data-label works better for speech recognition than datasets of multiple words clips.
Is there a supporting research/proof on my theory?
Why we do not have single word tests in Common Voice project but entire sentence?
Do you think single word clips should be added and are equally important and would improve the overall dataset of CV?

othiele · August 28, 2020, 8:15am

I disagree, in my view you should have about the same input material as you want to recognize. In case you want to do just that (single words), you could use such input. There is a Common Voice data set of single numbers, letters.

But I am happy to be proven wrong, just because mine works doesn’t mean there isn’t a better one

Topic		Replies	Views
How do I add single word for my language? Common Voice sentence-collection	6	1789	January 16, 2022
"Sentences" with only one word Common Voice	10	1390	June 3, 2022
Why are the first 10 words the same? Common Voice	3	434	March 2, 2024
Many single words in data set (UA) - is that OK? Common Voice sentence-collection	2	822	July 5, 2021
Do the Common Voice datasets contain multiple audio samples for the same text in the same language? Common Voice dataset	9	2245	April 20, 2020

Single word utterances better than sentence?

Related topics