I was trying to inference using pre-trained librispeech model with some audio sample randomly collected from web. But the result is quite depressing, model predicted every single character wrong.
Ground truth: “The Story of Arthur the Rat. Once upon a time there was a rat who couldn’t make up his”
Predicted : “HAM AUUEWIR CCHIUVHE C O HO AA UBBUSH”
Is there any way to solve this?