Per word confidence

CoreyG · April 5, 2019, 2:30pm

Hey guys! Does anyone know how to detect the confidence/probability per word when you inference a sentence in DeepSpeech? in example with the sentence:
“Hey how are you?”

Hey = 90% confident
how = 65% confident
…etc

dabinat · April 6, 2019, 2:09am

This should be coming soon: https://github.com/mozilla/DeepSpeech/pull/2012

However, note that it is per-letter probability, not per word, and I’m not sure exactly how it will be exposed to clients from the API.

reuben · April 6, 2019, 3:46am

That PR exposes per-transcription probability, not per letter or per word. Doing either of those requires extending the decoder to keep track of the character/word level info.

CoreyG · April 8, 2019, 5:14pm

Alright cheers thanks guys!

kdavis · April 8, 2019, 5:28pm

Per word timings would require we embed language info into the engine.

For example, in English you can, more-or-less split on spaces to get words. However, for Simplified Chinese Mandarin each character is a word. So code that split on spaces would not split on words for Mandarin. So there would have to be code in the engine that works differently for different languages if we split on words.

Embedding language specific info into the engine is not something we want to do. We want the engine to remain as language independent as possible.

reuben · April 8, 2019, 5:33pm

Kelly, note this is about probabilities (approximate confidence values), not timings. We can compute per character probability (per timestep token in a generic way) but we don’t currently do it and it would be a significant overhead in the state size for the decoder.

reuben · April 8, 2019, 5:34pm

Sorry, per word probabilities, not character. Currently we only store per-beam (per transcription candidate) probabilities.

CoreyG · April 8, 2019, 5:36pm

Alright thank you. I really appreciate the replies

kdavis · April 9, 2019, 4:27am

Sorry, don’t know why I ended up saying “timings” and not “probabilities”.

But you seem to suggest, assuming you’re trying to do this in a language independent manner, that it possible to segment the output text on words. How would that work in a language independent manner?

reuben · April 9, 2019, 1:23pm

I’m not suggesting that, just saying anything finer grained than per-candidate sentence probability requires extending the decoder.

Topic		Replies	Views
How to obtain probabilities of each character DeepSpeech	4	532	July 24, 2020
Per letter probability DeepSpeech	5	415	February 8, 2021
Obtain per-word confidence score DeepSpeech	1	1035	September 12, 2019
Quick heads up on some metadata / confidence estimate work we're doing DeepSpeech	10	1259	July 30, 2019
Questions about timings coming from metadata DeepSpeech	1	442	July 24, 2020

Per word confidence

Hey guys! Does anyone know how to detect the confidence/probability per word when you inference a sentence in DeepSpeech? in example with the sentence: “Hey how are you?”

Related topics

Hey guys! Does anyone know how to detect the confidence/probability per word when you inference a sentence in DeepSpeech? in example with the sentence:
“Hey how are you?”