How hard is it to train a new model?

Web_Dever · February 8, 2020, 8:03pm

Hi guys, just a very newbie question.

I am a mainstream developer (full stack) and have some ideas to develop an application. I have no previous knowledge of voice-recognition algorithms and only have basics of AI/ML.

My native language is Portuguese. I wanted to build an application for my language. The thing is that I’m from Portugal, and the Portuguese available on Mozilla Deepspeech is Brazillian. European and Brazillian Portuguese are extremely different phonetically.

Because of this, I would have to start building a language model from ground zero.

My question is, realistically, how hard would it be for me to develop such a model using Deepspeech? I’ve searched everywhere for a question like this but couldn’t find anything about it.

Thank you in advance guys.

dabinat · February 10, 2020, 1:29am

In terms of actual difficulty, not much. But there are scale barriers - you would need at least a few thousand hours of transcribed Portuguese speech and multiple graphics cards for training in a reasonable amount of time (or rent a multi-GPU server by the hour on AWS).

Web_Dever · February 10, 2020, 6:29am

I see, thank you for the info!

lissyx · February 10, 2020, 9:26am

@reuben might be able to help there

Topic		Replies	Views
Tutorial: Training a Dutch model DeepSpeech learning	6	2992	July 9, 2020
Share your trained model for Mozilla DeepSpeech? DeepSpeech	6	465	April 14, 2020
How much data for deepspeech for japanese english accent DeepSpeech	0	300	March 30, 2021
Training a custom model DeepSpeech	8	1540	August 10, 2020
I'm looking for an Esperanto Pretained model DeepSpeech	3	241	March 21, 2021

How hard is it to train a new model?

Related topics