Fine-tuning DeepSpeech Model (CommonVoice-DATA)

nmstoker · August 20, 2019, 10:36am

I see - I hadn’t picked that up from your initial question.

0.4.1 did include a snapshot from English Common Voice. There was some discussion on this [here] (Any reason 0.5.x models weren't trained on Common Voice data this time?) : why 0.5 didn’t include Common Voice (was an oversight) and that it’s likely to be in the released models for 0.6 once it’s out of alpha.

If you’re trying to improve it specifically for Indian English some fine tuning might help (I know others on here have been looking at that for Indian accents but I don’t know how they’ve got on). Another approach to consider would be including Indian sourced text in the LM, since that could help it cope with “Indianisms” that aren’t typically part of the American / British English data that likely make up the bulk of the LM data.

Topic		Replies	Views
Pre-trained model become worse when i trained common voice data DeepSpeech	15	1795	September 21, 2019
Fine Tuning with limited data - Questions on Fine Tuning in General DeepSpeech learning	3	2585	September 24, 2020
How to trained a model for common voice dataset using deepspeech v0.6.1? DeepSpeech	23	2711	March 16, 2020
Fine Tuning with Custom English Data(Very Small Size) DeepSpeech	1	377	April 5, 2021
Question with DeepSpeech Transfer Learning DeepSpeech	40	2832	March 28, 2020

Fine-tuning DeepSpeech Model (CommonVoice-DATA)

Related topics