We are using DeepSpeech to transcribe audio files that mostly pertain to medical lexicons. We tried adding a couple of words that were initially unavailable in the pre-built model and then created our own scorer file following the link : External scorer scripts — Mozilla DeepSpeech 0.9.3 documentation
DeepSpeech model : 0.9.3
native_client : native_client.amd64.cpu.linux.tar.xz
Linux was used to build KenLM binaries and generate scorer : Ubuntu 20.04
Python version : 3.8.5
Do you do everything on the same machine? Screenshots look like Windows to me. Test all with a really small scorer on one machine. And check how this message is thrown in the native client.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
3
No external resources, please, share everything here as text only. It’s also super messy, you share infos about Ubuntu 20.04 but it’s all screenshots from Windows and obviously not over PuTTY.
We don’t support training on Windows, and we have no experience with WSL.
You also don’t share your code to repro: please verify your scorer works with our binaries, first. And why do we see ds.disableExternalScorer() followed by ds.enableExternalScorer() ?
KenLM is a git submodule, please use the same sha1 instead of downloading random version?