@victornoriega7 I have around 2600 samples
and Now i am used Deepspeech v0.6.1
is Above command right or wrong ? for train model using kenLM ?
I know dataset is low but i want recognize few commands?
@victornoriega7 I have around 2600 samples
and Now i am used Deepspeech v0.6.1
is Above command right or wrong ? for train model using kenLM ?
I know dataset is low but i want recognize few commands?
2600 samples are very very poorâŚ
How many sentences in your vocabulary ?
How many different voices ?.
ArghhhhâŚ
My friend, itâs totally impossible !!!
Ex:
2600 samples of only 1 speaker,
for 200 sentences,
with nearly 500 different words
with an alphabet of 26 to 30 charactersâŚ
You could reach a 40 to 60% accuracy max.
For 200 different sentences, for 4 peoples, itâs impossible without a bigger model.
It it was my problem, iâd work with at least
10000 train for each person, and separate it in 70/20/10% train/dev/test.
Sorry for the bad news.
@elpimous_robot Itâs ok.
Can I increase a dataset using audio augmentation ?
Its help me or not? actually I am new in that so �
Yes if course it will help, but after basisâŚ
It will not help here for now.
Not enough datas.
Augmentation is helpfull to add noise, echos, durations, and tone.
But the most important part is good initial datas, and ENOUGH
@elpimous_robot thanks friend
@lissyx and @elpimous_robot
Inference taking long time any idea to fast that process?
eg.
Loaded model in 0.0259s.
Loading language model from files KenLM-model/trie
Loaded language model in 0.00017s.
Running inference.
hi how are you
Inference took 3.328s for 5.952s audio file.
is any solution please help me?
Hey, you canât just ask people random questions without context. 3.3s for 5.9s is quite fast.
@lissyx apologize.
really i was not having the idea but still i got the answer which i was expecting thatâs why i said thank you
next time i will make sure to give the proper context before asking.
Please tell me what is difficult to understand in asking âitâs not fast enough, what can I doâ without even telling what is your hardware, what are your constraints.
If youâre using 0.6.1 you should also update your trie and lm.binary. Are you sure that youâre generating your LM from the file with all of your possible commands?
Can I increase a dataset using audio augmentation ?
No. From the source code, I infered that audio augmentation donât create new files, just transform the current audio into something noisy. This is to create a trained model more robust for noisy tests and that can generalize well.
In your case, you donât need that good generalization, because you already know that only a few persons will be using it.
Try getting more data like the french robot topic.
Yes, and I could say : after more more more datasâŚ,
use python, or a terminal command to duplicate all your datas, and process audio transformations, to slighty change audio specs.
Youâll have 2x more datasâŚ
The more datas, the better your accuracy.
Note : pay attention in data augmentation values !! use small changes, or youâll train bad audio files, and your accuracy will not increase.
@Sudarshan.gurav14,
Friend, deep learning learns us patience !
You need to do like all of us, progress slowly, read, readâŚread, test your own ideas.
And magic will appear !
I was change the recording speed like slow , fast
I want to change gain of audio using voice corpus tool as you suggest
how much gain i change now i am change my gain 0.5 is it ok ?
there is one more arg -times i didât no how to use can please help me?
Now, i am decrease my commands i just want 70 command out of 200
One more que:
suppose i have 1 wav can i change its gain 2 time mean
1.wav [ original]
1_gain_05.wav [same file]
1_gain_07.wav [same file]
is it ok?