Can't recognize my words

hebo · April 26, 2021, 12:26pm

input wav file:
s1.wav.zip

model_file = ‘deepspeech-0.9.3-models.pbmm’
scorer_file = ‘deepspeech-0.9.3-models.scorer’

output:

{
  "transcripts": [
    {
      "confidence": -17.701541900634766,
      "words": [
        {
          "word": "i",
          "start_time": 3.76,
          "duration": 0.0
        }
      ]
    }
  ]
}

something wrong with me?

ftyers · April 27, 2021, 6:04pm

The audio file sounds Chinese. The model file and scorer and output look English.

hebo · April 29, 2021, 6:00am

“I have a book”

Maybe I’m not reading it clearly.

ftyers · April 29, 2021, 2:10pm

Oh, sorry, I think I must have seen the Chinese title and got confused. Yes , I hear it now, but I also see that the file is stereo. You want to try and give it mono input.

hebo · April 30, 2021, 4:57am

Thanks for your reply.
I try “ffmpeg -i s1.wav -ac 1 s2.wav” get mono file.
It’s work!

Topic		Replies	Views
Pretrained Chinese Model Invalid Inference Output DeepSpeech	5	572	March 24, 2021
DeepSpeech Problems with Speech Recognition Using Microphone DeepSpeech issue	12	2170	February 3, 2021
Installing Deep Speech for the first time: thinking out loud DeepSpeech	10	1578	March 13, 2020
How can i process Chinese Mandarin speech recognition DeepSpeech	7	2504	August 21, 2020
Link to mandarin chinese text corpus DeepSpeech	10	745	January 25, 2022

Can't recognize my words

Related topics