I have an Indian accent and trying DeepSpeech pre-trained English model, V0.91. When I give my voice as “Hello Im testing DeepSpeech thank you very much”, to DeepSpeech, Im getting the output as “hoooooo”. Kindly suggest how to recognize my voice. Attaching the snippet of code i used. Thanks for your help and suggestion.
import numpy as np
model_file_path = r"C:\deepspeech\0.91\deepspeech-0.9.1-models.pbmm"
model = deepspeech.Model(model_file_path)
scorer_file_path = r"C:\deepspeech\0.91\deepspeech-0.9.1-models.scorer"
recordingfile = r"C:\testingdeepspeech.wav"
w = wave.open(recordingfile, ‘r’)
rate = w.getframerate()
frames = w.getnframes()
buffer = w.readframes(frames)
data16 = np.frombuffer(buffer, dtype=np.int16)
text = model.stt(data16)