Use scorer deep speech in windows 10

reza · November 5, 2020, 8:57pm

Hello . I downloaded the following file from https://github.com/mozilla/DeepSpeech-examples/tree/r0.8/net_framework GateHub and ran it in Visual Studio software on Windows, but when I say a word through a microphone, for example:
HI
you are
two
on
Output text :
I
you are
to
o
Going back to this, I want to know how I can improve its accuracy for testing. I came across scorer in the documents, but I do not know how to use this in Windows. Thank you for your help.

https://deepspeech.readthedocs.io/en/v0.8.0/Scorer.html

othiele · November 6, 2020, 8:15am

Please read the docs and search before you post, for .NET you can use:

sttClient.EnableExternalScorer(scorer ?? “kenlm.scorer”);

reza · November 6, 2020, 8:34am

Hello . I said words like close , you are , Hi . through the microphone, but the output showed me these. What can I do to improve the DeepSpeak model through the microphone? I just want to use deep speech , which means I do not want to teach datasets.

lissyx · November 6, 2020, 8:40am

You don’t provide any context, and poor detection can be related to a lot of factors: speed of speech, accent, etc.

Not to mention you are using example code, which is provided as-is.

reza · November 6, 2020, 8:46am

I said a word like hello, close, etc. through the microphone in English with a delay of about 5 seconds, but I did not get a good output. My Native language is Persian

Another test I said in a sentence like How are you without pausing but still did not get a good result.

lissyx · November 6, 2020, 8:48am

Sorry, but I have already explained to you:

example code, we can’t guarantee there is no bug
model is mostly trained on american english
you are speaking with persian accent

reza · November 6, 2020, 8:51am

thank you . Now, if I test words with an American English accent through the Logman Dictionary, for example, will I get a good result?

And can I teach Mozilla on a 50 GB English database to improve Deep speech mockery? Do I have to do fine tune or transfer learning ?

lissyx · November 6, 2020, 8:54am

As said previously, we can’t guarantee, I don’t know what the logman dictionary does here.

Again, it depends on what data that is … If you have 50GB of English in Persian accent, you can fine-tune the released english checkpoints. Please check the docs.

reza · November 6, 2020, 8:58am

thank you . By 50GB I mean the following database

othiele · November 6, 2020, 9:01am

You are mixing a lot of different subjects here.

Test some audio that you record with a mic directly with command line DeepSpeech to see how it does with your accent.
Try the 0.9.1 model.
Understand how it works then use it in .NET.
Common voice or a dictionary won’t do much good if you are trying to recognize a Persian accent. You will need hundreds of hours of Persian accent to train.

lissyx · November 6, 2020, 9:01am

We are documenting that Common Voice English is used for training: Release DeepSpeech 0.8.2 · mozilla/DeepSpeech · GitHub

Topic		Replies	Views
EnableExternalScorer failed with 'Invalid scorer file.' (0x2002) DeepSpeech	3	1273	March 25, 2021
Installing Deep Speech for the first time: thinking out loud DeepSpeech	10	1578	March 13, 2020
Mprove speech to text deep speech DeepSpeech issue	3	637	November 4, 2020
DeepSpeech for narrow-domain bot creation DeepSpeech	26	1114	February 11, 2021
Explanation of how the Scorer works after predicted transcripts? DeepSpeech	4	865	June 4, 2020

Use scorer deep speech in windows 10

Related topics