I have recording in mp3 format which i converted into wav using ffmpeg and it sounds absolutely clear.
!ffmpeg -i result1.wav -ar 16000 output1.wav
On using the deepspeech model- i am getting absurd response. following is my code-
“”"
from deepspeech import Model
import wave
import numpy as np
model = Model(’./deepspeech-0.9.3-models.pbmm’)
model.enableExternalScorer(’./deepspeech-0.9.3-models.scorer’)
fin = wave.open(‘result1.wav’, ‘rb’)
audio = np.frombuffer(fin.readframes(fin.getnframes()), np.int16)
fin.close()
Perform inference
infered_text = model.stt(audio)
infered_text
“”"
Please suggest if i have missed anything or let me know for further information.