A debugging script that shows most likely characters and their probabilities can be found here:
https://github.com/pvanickova/DeepSpeech/blob/master/bin/show_inferred_characters.py
It loads a frozen graph, runs inference on given wav, model and alphabet, softmaxes the logits to get the probabilities and displays most likely characters and their probabilities, “-” is used for blank. Each character prediction is on a new line so the predicted text can be read in columns.
E.g. “cat” string could have these character predictions:
c k - (0.999957) (1.62438e-05) (6.99057e-06)
a - (0.999998) (1.05044e-06) (4.43978e-07)
t d - (0.999999) (4.27885e-07) (1.09088e-07)
This is how the script is run:
python3 ./bin/show_inferred_characters.py --input-file "../data/my.wav" --model-file ../data/models/output_graph.pb --alphabet-file ../data/models/alphabet.txt --predicted-character-count 3
If it looks useful enough for others, give me a shout and I’ll create a pull request.