How can silence be better handled?

tarekeldeeb · June 4, 2020, 10:24am

I have trained a custom model and it’s giving accepted results so far.

Problem is with silence moments, especially at the beginning. When I do inferring with --json the first word always start at time=0, and usually a wrong word is detected. Consecutive words are usually correct.

The question is: How shall silence be “Skipped”, so the first word doesn’t always start at zero … and give a better detection?

lissyx · June 4, 2020, 10:28am

This is where it’s the best place to fix it

We’ve got reports of behavior like that on Github, but could not really replicate.
People adjusted the library to add some padding of a few ms (50 I think) and it was helping a lot. Given we could not replicate the original issue, hard to actionate on that for now.

tarekeldeeb · June 4, 2020, 11:29am

Thanks for that tip … will try it out.

Topic		Replies	Views
Training DeepSpeech on (near) silence? DeepSpeech	3	272	August 31, 2020
Some words getting skipped in whole sentence DeepSpeech	9	698	May 22, 2019
Model misses some of the word during inference DeepSpeech	15	1020	November 4, 2020
Text produced has long strings of words with no spaces DeepSpeech	22	4018	April 30, 2018
Regarding wav files creation DeepSpeech	4	399	May 5, 2020

How can silence be better handled?

Related topics