Hyperparameter Tuning - Grid Search in DeepSpeech

agarwalaashish20 · June 5, 2019, 6:53pm

Hello Team! I am training a German Speech model using DeepSpeech and struggling to get the best set of Hyperparameters. I referred to the documented Hyperparameters in the DeepSpeech releases but the model is not producing great results (WER 30). On increasing the number of epochs to more than 10, the model is overfitting i.e. Training loss less than 5% but Validation loss more than 100%. I am wondering if there is a way to tune the Hyperparameters using Grid Search or any other way in DeepSpeech?

I am using approximately 300 hours of Dataset, with following parameters:

–test_batch_size 12
–train_batch_size 12
–dev_batch_size 12
–epochs 10
–learning_rate 0.0001
–dropout_rate 0.30

kdavis · June 6, 2019, 8:11am

Independent of hyperparameters, 300 hours alone is not sufficient to train a model capable of understanding unrestricted speech. For English we use more than 10 times that amount of speech and that’s still not enough.

That said if 300 hours is all you have, I’d try fine-tuning the 0.4.1 English model with your German data and with the standard character substitutions ä=>ae, ü=>ue, ö=>oe, and ß=>ss to fix the fact that the alphabets differ.

As for the hyperparameters, first try this fine-tuning with the ones you are using.

lissyx · June 6, 2019, 9:00am

@agarwalaashish20 To complement that answer, I can confirm that with a somewhat lower amount of audio, around 235 hours of french, including Common Voice released data, and fine-tuning on top of english with compatible alphabet, even though WER / CER of test set are not really awesome, actual fiel usage under proper condition gives decent enough result (speak slow, articulate, etc).

lissyx · June 6, 2019, 9:03am

If you need https://github.com/Common-Voice/commonvoice-fr/blob/master/DeepSpeech/CONTRIBUTING.md and current WIP PR: https://github.com/Common-Voice/commonvoice-fr/pull/44

Topic		Replies	Views
DeepSpeech for German Language DeepSpeech	22	12860	February 16, 2021
Hyper-parameter Selecting, Decreasing Loss DeepSpeech	2	458	August 10, 2020
Step, epoch, hardware, weird Duration DeepSpeech	8	616	July 1, 2020
Training of Epoch x - loss: inf. again decrease LR, stil same issue repeating DeepSpeech	10	861	October 18, 2018
Loss Not Reducing DeepSpeech	5	2698	April 12, 2018

Hyperparameter Tuning - Grid Search in DeepSpeech

Related topics