Restricted Vocabulary

mischmerz · January 14, 2020, 6:42pm

I’ve been following this project for a while now. Great work. I am still wondering if there’s a viable way to restrict the vocabulary to certain words used in e.g. a home automation environment to improve accuracy. I just want Deep Speech to understand a handful of words such as ‘turn the light on’ or ‘turn the living room fan off’. After Snips has gone Sonos, pretty much everybody in the maker environment is looking for alternatives.

lissyx · January 14, 2020, 6:44pm

Just make them into a text file and generate your own language model. It is documented under data/lm.

mischmerz · January 14, 2020, 7:43pm

Just to be sure: Does this require a new set of wavs or are is the restricted language model extracted from the complete model?

m.

lissyx · January 14, 2020, 7:58pm

Your build it yourself. This is the language model, you can re-use the acoustic model. We have experimented quite a lot, and it gives pretty good results for that kind of use-case. This way you don’t have to rebuild a long training step, small language model can be created quickly and efficiently. Just follow the link, and build kenlm: I have code doing that on device (RPi4), for example.

chags · February 3, 2020, 7:32am

@lissyx would you mind sharing the code you mentioned for doing this on device? Thanks.

lissyx · February 3, 2020, 7:50am

There’s nothing to share, I just built KenLM’s tools for ARM and ran them on-device …

beiserjohannes · February 3, 2020, 7:29pm

Have a read here

Topic		Replies	Views
Limited language model in noisy environment DeepSpeech	2	347	April 27, 2020
Tune MoziilaDeepSpeech to recognize specific sentences DeepSpeech	76	11553	March 25, 2023
Customizing language model DeepSpeech	13	8610	February 27, 2018
Create A SubSet of existing models DeepSpeech	25	2084	March 30, 2019
"Hello this is a test" returns "hallo this is a tast" (other non-words also returned) DeepSpeech	3	386	May 12, 2020

Restricted Vocabulary

Related topics