Transfer learning or Fine tuning with utf-8

xhtm · April 21, 2020, 3:04am

I have a bilingual dataset, which is <20 hours. So, I will have to use the pre-trained Deepspeech model.

I want to use utf-8 bytes setting to either do transfer learning or fine tuning using my dataset. But I don’t have a pre-trained model that uses utf-8 bytes. Language doesn’t matter because I am using utf-8 bytes.

Can I drop the last layer in the pre-trained deepspeech model, and use 256 labels corresponding to the utf-8 bytes to further train on my dataset?

Is there any other way I can do both a) use the pre-trained deepspeech model, and b) be able to use utf-8 bytes.

Topic		Replies	Views
Fine tuning vs Transfer Learning in latin alphabet + a few special characters DeepSpeech	2	434	June 13, 2020
Transfer Learning for German Language DeepSpeech	2	1125	October 22, 2019
Transfer learning between different languages DeepSpeech	35	4988	June 19, 2020
No result training model in Brazilian Portuguese with UTF-8 DeepSpeech	3	434	August 5, 2020
Transfer learning to Urdu with less amount of data - better approach? DeepSpeech learning , issue , dataset	13	1476	May 16, 2022

Transfer learning or Fine tuning with utf-8

Related topics