Quality of the CTC decoder / limit inference words to known words only

elpimous_robot · December 1, 2017, 8:22pm

Hello all,

I’m using a homemade model, for french language, files are ok.

My mic array records in continuous, and inference is done when a vader function cut.

When I talk, without noise, inference is very good, but…
with TV or other noise, inference produce anything like this :

le a eaefanke eethe
It doesn’t correspond to any word in my vocabulary.txt

my question :

How could I restrict inference to known words ? (and forget others)

Thanks all.

francob · December 1, 2017, 8:34pm

Hi, if you check out the hyperparameters and increase LM_WEIGHT and VALID_WORD_COUNT_WEIGHT then you should get better results.

elpimous_robot · December 1, 2017, 8:55pm

Hi Francob.
Thanks, I’ll try it…
PS : what hyperparams do you use ?

francob · December 1, 2017, 9:24pm

5 for LM_WEIGHT and 3 for VALID_WORD_COUNT_WEIGHT, haven’t optimized them yet though

dbanka · December 27, 2017, 7:41am

Hey Francob, How can we optimize LM_WEIGHT , WORD_COUNT_WEIGHT and VALID_WORD_COUNT_WEIGHT

elpimous_robot · December 28, 2017, 7:17pm

Hi. They are params !
set it like this :

-- WORD_COUNT_WEIGHT = 5 \
...