dara1400
(Dara1400)
June 11, 2019, 7:18am
1
Hi
Thanks Mozilla for its wonderful DeepSpeech project,
I have some problems with accuracy,
In windows speech recognition library you can limit the vocab to only focus on some sentences or words. so the result will be so accurate.
Is there any option for this in Mozilla Deep Speech?
4 Likes
kdavis
(kdavis)
June 11, 2019, 7:55am
2
You can train your own language model on only the desired sentences. See here for how we trained ours on a larger set of text.
1 Like
dara1400
(Dara1400)
June 11, 2019, 8:08am
3
thanks for your answer
I didn’t get what i shloud do?
Is there any tutorial or video (explaining step by step)?
1 Like
dara1400
(Dara1400)
June 12, 2019, 7:19am
6
how can i find generate_trie?
yv001
(Yv)
June 12, 2019, 7:26am
7
From native client in releases . Choose one for your system, e.g. for linux, latest native client 0.5.0 is here
1 Like
dara1400
(Dara1400)
June 12, 2019, 8:00am
8
for words that deep speech predicting can i have collection of alternatives ?
nmstoker
(Neil Stoker)
June 12, 2019, 12:07pm
9
Yes, although it’s worth experimenting to see the impact.
Might be worth reading a bit about Language Models in general so you have an idea of what they’re for and why they’re being used here.
1 Like
dara1400
(Dara1400)
June 12, 2019, 11:31pm
10
I cant find any example of that, Would you send me an example of getting collection of alternative words from MozillaDeepSpeech?
dara1400
(Dara1400)
June 13, 2019, 5:13am
11
thanks to your help
I successfully generate lm.binary and trie
but deepspeech crash on it without say any error
dara1400
(Dara1400)
June 13, 2019, 9:20am
12
thanks to your help
I successfully generate lm.binary and trie
but deepspeech crash on it without say any error
this is my text file:
one
two
three
how
what
could
where
report
maximize
minimize
could you maximize the form
would you minimize the form
I also had to add --discount_fallback to lmplz
and i am using deepspeech 0.5.0
what is my mistake?
@kdavis @yv001 @nmstoker
yv001
(Yv)
June 13, 2019, 10:01am
13
I haven’t migrated to 0.5.0 from 0.4.1 yet, so that might be specific to the new release. Custom lm models work just fine for me on 0.4.1.
1 Like
dara1400
(Dara1400)
June 13, 2019, 10:21am
14
OK
I will test it on 0.4.1,
Did you use --discount_fallback to lmplz
Is the version of kenlm important?
@yv001
yv001
(Yv)
June 13, 2019, 10:25am
15
no, i did not use the option
1 Like
dara1400
(Dara1400)
June 13, 2019, 10:40am
16
I think I need an example of a text file winch is fine for kenlm,
I didn’t find any on the web,
would you send me an example?
@yv001
yv001
(Yv)
June 13, 2019, 10:49am
17
one is referred in the original link above and used in the script, you can also read about the lm in this post .
Simple utf8 encoded file with lowercase a-z and apostrophe phrases one per line should work. Also make sure that you have compatible line endings (\n on linux).
1 Like
dara1400
(Dara1400)
June 13, 2019, 10:55am
18
I have to train model after creating new LM and Trie?
my sentences are English
@yv001
yv001
(Yv)
June 13, 2019, 11:11am
19
For standard English, the acoustic model can stay the same. Fine tuning of the acoustic model would be needed if you were planning on transcribing atypical words, e.g. names of products or companies etc.
1 Like
dara1400
(Dara1400)
June 13, 2019, 12:10pm
20
thank a lot
So I dont have to train the model because i dont need any new words.
I could not find solution for my problem of making LM and Trie
this is my text file:
one
two
three
how
what
could
where
report
maximize
minimize
could you maximize the form
would you minimize the form
I know this will so much asking, Would you test it? (making lm and trie and run deepspeech)
@yv001
yv001
(Yv)
June 13, 2019, 1:14pm
21
sorry, don’t have time to setup 0.5.0 now
1 Like