- Generate the LM from your text corpus.
- Generate the scorer package with LM above and any default alpha and beta values.
- Run
lm_optimizer.py
with your own data, fine tuning on LibriSpeech test does nothing to help your use case. - At the end, regenerate the package with the new fine tuned alpha and beta values.