The problem of training the Chinese Model

jackhuang · January 30, 2018, 6:44am

I trained the model using the Chinese speech data set. The training process dosen’t take a long time(less than 3 hours). However, it takes a long time to test and export the model(more than 3 days). Is it caused by the large alphabet size of Chinese, or other factors?

Branden_Stark · January 31, 2018, 2:44am

I also happen this problem. I think maybe because the large alphabet size and long audio.

jackhuang · January 31, 2018, 2:59am

Are you also training the Chinese model?

Branden_Stark · January 31, 2018, 5:24am

Yes,I trained a Chinese model ,use Hanzi as alphabet.

jackhuang · January 31, 2018, 12:06pm

Do you think that only the testing process take such a long time?

Branden_Stark · February 1, 2018, 7:01am

The training process seems not take much time

kenyeung128 · February 26, 2018, 11:09am

the loss seems big? should it be converged?

jackhuang · February 26, 2018, 11:28am

It seems that the loss will be converged at about 230. The loss is too high, and the result is very bad. No matter what voice data is given, the model will output the same and very short result like “额”.

lissyx · February 26, 2018, 11:32am

@jackhuang Just one reminder, please try to paste text for this kind of output, images are not indexed, takes more time to load, and are always complicated to search in

lissyx · February 26, 2018, 11:33am

Can you document your training parameters ? Amount of audio, width of the network, etc. ? We have no feedback on languages like Chinese, so it’s likely you will have to play trial / error to move forward

kenyeung128 · February 26, 2018, 2:10pm

Yes its quite big, have you tried training with a batch of short utterances first? Also how many characters in alphabet?

Topic		Replies	Views
Training Traditional Chinese for Common Voice using Deep Speech DeepSpeech	18	2693	November 19, 2020
How to make the testing process more quickly? DeepSpeech	20	3046	October 18, 2018
Training Chinese model DeepSpeech	22	9073	April 22, 2021
Problem : converging to a wrong model DeepSpeech	3	797	November 29, 2018
Training Vietnamese model DeepSpeech	33	3566	May 21, 2019

The problem of training the Chinese Model

Related topics