How can i process Chinese Mandarin speech recognition

KimYip · June 30, 2020, 6:15am

I have read the docs of version 0.7.4, but i still can’t get a full vision in chinese Mandarin speech recognition.
Usually, in other aticles, they offen don’t use unicode codepoint and do two things more.
The first is dividing sentence into phrases in language model,
and the second is using chinese pinyin(chinese phoneme) as a media between sound and character.

So my question is how can i process Chinese Mandarin speech recognition using our model more effectively.

othiele · June 30, 2020, 8:05am

Great to have you here. Why don’t you start by setting up an English training so you get to know DeepSpeech as a software. Then you can switch over to Mandarin.

@reuben has been training a Mandarin model for some time, don’t know whether there is a repo or other resources you could use?

Alexander_Liu · August 20, 2020, 12:40am

@othiele For those people who knows how DeepSpeech works, do you know where to find trained Chinese voice models and scorers?
Appreciate your comments!

othiele · August 20, 2020, 7:29am

Chinese seems to be hard to find for Mozilla DeepSpeech and there were some questions about it. Maybe search the forum and ask others.

But as we are an open community, there seems to be one in a similar, but different, repo. If it works, please let us know.

Alexander_Liu · August 20, 2020, 7:05pm

To be honest, I was trying it last night. But the repo didn’t come with a trained model. It comes with a Chinese voice dataset which is more than 150GB. I don’t think my homemade workstation can load that work… But it works anyway. The only thing is that no public trained model for now… I’m looking for…

othiele · August 20, 2020, 7:17pm

I meant this model section, they are about 800 MB.

Alexander_Liu · August 20, 2020, 9:39pm

Thank you. I downloaded it.
the compressed file is like this:

Do you know how to use it? GitHub - PaddlePaddle/PaddleSpeech: Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
File params.pdparams is 800MB.
I didn’t find how to use the deepspeech engine with the trained model… I have downloaded the docker image from PaddlePaddle…
Is it like this? deepspeech --model params.pdparams --scorer some.scorer --audio myaudio.wav?

othiele · August 21, 2020, 7:12am

Sorry, we don’t support the competition

This project has nothing to do with the other, therefore it is hard to say much. But they have a file called infer.py

Topic		Replies	Views
What is the right format for building a language model for chinese? DeepSpeech	0	302	January 25, 2022
Training Traditional Chinese for Common Voice using Deep Speech DeepSpeech	18	2674	November 19, 2020
Link to mandarin chinese text corpus DeepSpeech	10	745	January 25, 2022
Training Chinese model DeepSpeech	22	9052	April 22, 2021
Any pretrained Chinese model can be shared? DeepSpeech	0	327	July 29, 2020

How can i process Chinese Mandarin speech recognition

Related topics