Inaccurate results from 0.9.3 model

Srishti_Nagu · April 16, 2024, 7:30am

I have recording in mp3 format which i converted into wav using ffmpeg and it sounds absolutely clear.

!ffmpeg -i result1.wav -ar 16000 output1.wav

On using the deepspeech model- i am getting absurd response. following is my code-

“”"
from deepspeech import Model
import wave
import numpy as np
model = Model(’./deepspeech-0.9.3-models.pbmm’)
model.enableExternalScorer(’./deepspeech-0.9.3-models.scorer’)
fin = wave.open(‘result1.wav’, ‘rb’)
audio = np.frombuffer(fin.readframes(fin.getnframes()), np.int16)
fin.close()

Perform inference

infered_text = model.stt(audio)
infered_text
“”"

Please suggest if i have missed anything or let me know for further information.

jesslynnrose · April 16, 2024, 7:47am

Hello! Thanks so much for getting in touch and I’m so sorry that I’m only going to be able to offer very limited support. The Common Voice is a Mozilla Foundation project that is an opensourced, multilingual speech corpus and the platform and tools dedicated to supporting and collecting this data. I’m so sorry that Deepspeech is a separate project held by the Mozilla Corporation and I’m not clear on how actively it is being maintained and supported.

My expertise and the scope of this forum is limited to the Common Voice datasets, perhaps the Deepspeech Github might be a better source of information?

Topic		Replies	Views
"Transcription wrong on giving audio from Mozilla Common voice dataset DeepSpeech	17	692	April 9, 2020
Fine-tuning DeepSpeech Model (CommonVoice-DATA) DeepSpeech	60	6175	August 20, 2019
Empty results in inference mode DeepSpeech learning	3	603	February 6, 2020
Using common voice datasets? DeepSpeech	5	1072	November 17, 2020
DeepSpeech giving bad results DeepSpeech learning	5	2312	February 11, 2020

Inaccurate results from 0.9.3 model

Perform inference

Related topics