"Hello this is a test" returns "hallo this is a tast" (other non-words also returned)

Paul_Raine · May 10, 2020, 2:48am

It seems that my installation of deepspeech (0.6.1) is returning non-words:

“Hello this is a test” returns “hallo this is a tast”
“i like apples and oranges” returns “i like auples and aranges”

Am I doing something wrong? Can I configure it to return only actual words?

othiele · May 10, 2020, 12:27pm

The algorithm searches in the language model (trie + lm.binary) for words. If “tast” is in there, it can be an output. So you could reduce the word combinations in the language model.

Paul_Raine · May 12, 2020, 10:48am

So I have downloaded generate_lm.py and want to use it to generate my own language model, but I’m unsure how to do it… I just need to feed it a text file containing my sentences?

othiele · May 12, 2020, 11:45am

Sorry, you’ll have to read the code and/or read documentation. Building your own language model makes up roughly 20% of questions here, so be prepared to put in a couple of hours.

https://deepspeech.readthedocs.io/en/v0.7.0/TRAINING.html

Topic		Replies	Views
How language model is used in deepspeech DeepSpeech	5	8326	February 26, 2018
Issue with Language Model DeepSpeech	11	1057	January 3, 2019
Fine tune the Language Model DeepSpeech	3	503	December 6, 2019
Always same result DeepSpeech	1	381	February 24, 2020
How to test deepspeech on my computer? DeepSpeech	6	7973	March 8, 2018

"Hello this is a test" returns "hallo this is a tast" (other non-words also returned)

Related topics