Single speaker training

jcr678 · June 29, 2020, 3:55pm

I want to give classify speech commands for a single speaker. The speaker could have an accent, could not have an accent but it would always be in the English language. The dataset would be extremely small, (10 speech commands, 10 .wav recordings per command). Would mozilla deepspeech work for this, perhaps a fine tuning model? How would I go about this?

othiele · June 29, 2020, 5:01pm

If it is just 10 commands, simply try a custom language model. Ideally they sound differently. Search for that here in the forum, you’ll find some ideas on what to do.

Topic		Replies	Views
Train for only one voice DeepSpeech learning	4	1177	March 26, 2019
Automatic Speech Recognition for any speaker DeepSpeech	1	456	April 23, 2021
Fine Tuning with limited data - Questions on Fine Tuning in General DeepSpeech learning	3	2594	September 24, 2020
Training for a specific purpose with specific vocabulary DeepSpeech	4	399	June 12, 2020
4 Speaker Dataset - Training a Context Based Speech Recognition Model DeepSpeech	0	541	March 23, 2019

Single speaker training

Related topics