Trainig model loss

no, 93 phrase lasts 10 minutes)

You can’t expect to train anything with just 10 minutes. Again, what use-case do you target ?

Im uzbek, and there arent any fine uzbek SST, im goin to develop it) i mean my company, my city can use it after developing it. no commercial user for now

You need more than 10 minutes for generic-purpose STT. Are you contributing to Common Voice for Uzbek? That’s the best course of action at the moment.

Once you get a few dozen of hours, you can try and start building something with transfer-learning.

Ok thank you brother, i will try

1 Like

one question, can i repeate phrases ? for example above 93 phrases are only 13 different phrases. I just have repeated them. For example i have this phrase:
“Plastik yoqolsa bankka borib ochtiring”. This phrase repeated 7 times. and also 7 different audio with different tempo

We don’t have enough feedback yet on the behavior of DeepSpeech on that, it might depend on your dataset as well as your language. Repeated sentences (i.e. same sentence spoken by different people, so close to your case) seems to improve a bit the model. But we are far from having a definitive answer on that, so I would suggest to check that cautiously.

@Akmal_Nodirov A good way to start is to retrain something that is well known. Have you seen this video?

And for the number of samples. Typically, you will need about 10.000 samples of length 4-8 seconds to get somewhat good result for general language understanding.

Maybe you can get them here?