Language Model For Deepspeech

spygaurad · September 24, 2019, 10:00am

Hello,
I used a Kenlm built language model of vocabulary.txt ( text of transcriptions ) of Nepali language to build a DeepSpeech model and i built another model without using any language model.
The latter seems to be not working at all.

It means DeepSpeech use language model while training as well?
If so can I also use another language model for inferencing besides the one used for traning?

reuben · September 24, 2019, 10:02am

The language model is not used during training.

spygaurad · September 24, 2019, 10:19am

Why does the two models trained with and without language model infer differently while testing?

reuben · September 24, 2019, 10:16am

I said it’s not used during training, but it is used for inference (that’s what it’s for), which includes the final test epoch in the end. You can re-use the same checkpoint/model with and without the language model, or with different language models. The training phase itself is not dependent on the LM.

spygaurad · September 24, 2019, 10:19am

Trained with Nepali language model ( path to language model given while training )
Trained without Language model ( path not given… It might have used default lm.binary)

I used language model for inference for both above test.

reuben · September 24, 2019, 10:24am

The language model is not the cause of the discrepancy, something else is different in your training procedure. Like I said, the LM is not used during training.

spygaurad · September 24, 2019, 10:24am

Okay. Thanks for the information.

alchemi5t · September 24, 2019, 1:56pm

Set alpha and beta=0, so that any loaded LM won’t affect your results. You cannot be having the default lm loaded for nepali. @spygaurad

spygaurad · September 26, 2019, 2:46am

Do i need to retrain setting alpha and beta parameters to 0 @alchemi5t
or should i be using them during testing/inferencing ?

I can simply not provide language model in inferencing if i dont want my LM to affect the result.

alchemi5t · September 26, 2019, 2:54am

You don’t need to retrain it. Just set them to 0 while inferencing.

spygaurad · September 26, 2019, 7:25am

Thnks for the information @alchemi5t