Working with deepspeech I noticed that the overall recognition rate is not good.
This is not in accordance with what is claimed in the paper.
I am using cpu architecture and trying to transcribe my audio files, but the error rate is very high.
I am using Mono Channel, 16kh, 16 bit audio files.
I would really appreciate guidance over this.