I have read the docs of version 0.7.4, but i still can’t get a full vision in chinese Mandarin speech recognition.
Usually, in other aticles, they offen don’t use unicode codepoint and do two things more.
The first is dividing sentence into phrases in language model,
and the second is using chinese pinyin(chinese phoneme) as a media between sound and character.
So my question is how can i process Chinese Mandarin speech recognition using our model more effectively.
Great to have you here. Why don’t you start by setting up an English training so you get to know DeepSpeech as a software. Then you can switch over to Mandarin.
@reuben has been training a Mandarin model for some time, don’t know whether there is a repo or other resources you could use?
@othiele For those people who knows how DeepSpeech works, do you know where to find trained Chinese voice models and scorers?
Appreciate your comments!
To be honest, I was trying it last night. But the repo didn’t come with a trained model. It comes with a Chinese voice dataset which is more than 150GB. I don’t think my homemade workstation can load that work… But it works anyway. The only thing is that no public trained model for now… I’m looking for…