Non native english with transfer learning from V0.5.1 Model, right branch, method and discussion

carlfm01 · July 4, 2019, 12:05am

Here’s my insight that can be useful for you of my Spanish model.

FYI here’s the branch used for the tests, is just a few days behind the current master of DeepSpeech: https://github.com/carlfm01/DeepSpeech/tree/layers-testing

To use this branch, you will need to add and read the following params:

--fine_tune It will fine-tune the transfered layers from source model or not 

--drop_source_layers  single integer for how many layers to drop from source model (to drop just output == 1, drop penultimate and output ==2, etc)')

--source_model_checkpoint_dir The path to the trained model, it will load all the layers and drop the specified layers

Thing you can’t do with the current branch, fine tune specific layers, drop specific layer and freeze specific layers.

For the following results I only drop the last layer and fine tune the others.

Total hours	LR	Dropout	Epochs	Mode	Batch Size	Test set	WER
500	0.00012	0.24	1	Transfer Learning	12	Train-es-common voice	27%
500	0.000001	0.11	2	Transfer Learning	10	Train-es-common voice	46%
500	0.0001	0.22	6	From scratch	24	Train-es-common voice	50%

For 500h 1 epoch seems enough dropping the last layer and fine tuning the other ones.

As @lissyx mentioned I think you way to go is just fine tune the existing model with your data using a very low lr like “0.000001”

The transfer learning approach I feel is to solve the issue with different alphabets.

Topic		Replies	Views
Transfer learning between different languages DeepSpeech	35	4954	June 19, 2020
Some Beginner Questions DeepSpeech	1	297	March 29, 2021
Tutorial: Training a Dutch model DeepSpeech learning	6	2993	July 9, 2020
Mic-vad-streaming DeepSpeech	8	1155	May 31, 2021
Using common voice datasets? DeepSpeech	5	1072	November 17, 2020

Non native english with transfer learning from V0.5.1 Model, right branch, method and discussion

Related topics