Scorer hyperparameters

yk98 · May 12, 2020, 4:46pm

Hi,

I want to use the pre trained deepspeech 0.7.0 model as is. To use it for my use case, I saw that making an external scorer helps.

There are many hyperparameters that impact the scorer like “lm alpha”, “lm beta” along with a bunch of other parameters like: “beam width”

How will changing these parameters affect the transcription output?

Also is the scorer required to train the acoustic model??

Thanks!

reuben · May 12, 2020, 6:03pm

This should answer the hyperparameter questions: https://distill.pub/2017/ctc/

No, it’s only used for the test set at the end of training, which you can skip by simply not providing a --test_files parameter.

dabinat · May 12, 2020, 8:10pm

There’s a script in the repo, lm_optimizer.py, that can help you figure out the optimal alpha and beta parameters for your custom language model.

yk98 · May 13, 2020, 6:51am

are lm alpha and lm beta used for acoustic model training?