Train model but actual prediction is too poor

Sudarshan.gurav14 · February 22, 2020, 12:59pm

@victornoriega7 I have around 2600 samples

and Now i am used Deepspeech v0.6.1

is Above command right or wrong ? for train model using kenLM ?

I know dataset is low but i want recognize few commands?

elpimous_robot · February 23, 2020, 9:15am

2600 samples are very very poor…
How many sentences in your vocabulary ?

Sudarshan.gurav14 · February 23, 2020, 9:51am

@elpimous_robot my vocabulary is around 200 sentences.

my sentence length is around 8 words

elpimous_robot · February 23, 2020, 10:22am

How many different voices ?.

Sudarshan.gurav14 · February 23, 2020, 10:37am

@elpimous_robot
6 different voices contain
3 male and 3 female

elpimous_robot · February 23, 2020, 11:19am

Arghhhh…

My friend, it’s totally impossible !!!

Ex:
2600 samples of only 1 speaker,
for 200 sentences,
with nearly 500 different words
with an alphabet of 26 to 30 characters…
You could reach a 40 to 60% accuracy max.

For 200 different sentences, for 4 peoples, it’s impossible without a bigger model.

It it was my problem, i’d work with at least
10000 train for each person, and separate it in 70/20/10% train/dev/test.

Sorry for the bad news.

Sudarshan.gurav14 · February 23, 2020, 11:59am

@elpimous_robot It’s ok.

Can I increase a dataset using audio augmentation ?

Its help me or not? actually I am new in that so …?

elpimous_robot · February 23, 2020, 12:38pm

Yes if course it will help, but after basis…
It will not help here for now.
Not enough datas.

Augmentation is helpfull to add noise, echos, durations, and tone.
But the most important part is good initial datas, and ENOUGH

Sudarshan.gurav14 · February 24, 2020, 8:09am

@elpimous_robot thanks friend

@lissyx and @elpimous_robot
Inference taking long time any idea to fast that process?
eg.

   Loaded model in 0.0259s.
   Loading language model from files KenLM-model/trie
   Loaded language model in 0.00017s.
   Running inference.
   hi how are you 
   Inference took 3.328s for 5.952s audio file.

is any solution please help me?

lissyx · February 24, 2020, 10:06am

Hey, you can’t just ask people random questions without context. 3.3s for 5.9s is quite fast.

Sudarshan.gurav14 · February 24, 2020, 10:27am

@lissyx for help thanks

lissyx · February 24, 2020, 10:29am

@Sudarshan.gurav14 I won’t answer since you don’t care about my answers to read them?

Sudarshan.gurav14 · February 24, 2020, 10:36am

@lissyx apologize.

really i was not having the idea but still i got the answer which i was expecting that’s why i said thank you

next time i will make sure to give the proper context before asking.

lissyx · February 24, 2020, 10:39am

Please tell me what is difficult to understand in asking “it’s not fast enough, what can I do” without even telling what is your hardware, what are your constraints.

victornoriega7 · February 24, 2020, 2:38pm

If you’re using 0.6.1 you should also update your trie and lm.binary. Are you sure that you’re generating your LM from the file with all of your possible commands?

Can I increase a dataset using audio augmentation ?

No. From the source code, I infered that audio augmentation don’t create new files, just transform the current audio into something noisy. This is to create a trained model more robust for noisy tests and that can generalize well.
In your case, you don’t need that good generalization, because you already know that only a few persons will be using it.

Try getting more data like the french robot topic.

elpimous_robot · February 25, 2020, 7:50pm

Yes, and I could say : after more more more datas…,
use python, or a terminal command to duplicate all your datas, and process audio transformations, to slighty change audio specs.
You’ll have 2x more datas…
The more datas, the better your accuracy.
Note : pay attention in data augmentation values !! use small changes, or you’ll train bad audio files, and your accuracy will not increase.

elpimous_robot · February 25, 2020, 7:54pm

@Sudarshan.gurav14,
Friend, deep learning learns us patience !
You need to do like all of us, progress slowly, read, read…read, test your own ideas.
And magic will appear !

Sudarshan.gurav14 · February 25, 2020, 7:58pm

I was change the recording speed like slow , fast

Sudarshan.gurav14 · February 25, 2020, 8:02pm

@elpimous_robot Yes, Right

i am reading and understand concepts

Thanks

Sudarshan.gurav14 · March 3, 2020, 5:57am

Hi @elpimous_robot

I want to change gain of audio using voice corpus tool as you suggest
how much gain i change now i am change my gain 0.5 is it ok ?

there is one more arg -times i did’t no how to use can please help me?

Now, i am decrease my commands i just want 70 command out of 200

One more que:
suppose i have 1 wav can i change its gain 2 time mean

1.wav [ original]
1_gain_05.wav [same file]
1_gain_07.wav [same file]

is it ok?

Topic		Replies	Views
Trainig model loss DeepSpeech	27	1144	March 13, 2020
Evaluation of the model DeepSpeech	13	797	December 11, 2019
Problem : converging to a wrong model DeepSpeech	3	785	November 29, 2018
Getting very poor predictions from a trained model - 0.7 version over the existing checkpoint DeepSpeech	3	385	May 23, 2020
Question about lack of training data DeepSpeech	5	363	February 18, 2020

Train model but actual prediction is too poor

Related topics