I found few files related to version 0.4.0 but I couldn’t find a big tar file(~in GBs) which contains the model file/binaries which we have in version 0.3.0.
Can anyone help me get the latest pretrained binaries of the Deepspeech?
Thanks
There isn’t any --model parameter in DeepSpeech.py, so I don’t know what you’re talking about. It looks like you passed – epoch -3 instead of --epoch -3 (note the dashes), probably some text editor screwing with you, so it defaulted to 75 epochs.
It was – epoch -3 only, pasted in the wrong format. But I have a question, if there is a gap between – and epoch , is that a problem? ex – epoch or should there be no space like
–epoch ?
For me, in order to call .pb, --initialize_from_frozen_model was used and for calling the .pbmm --model was used. A lot of users are also using the same command to call it.
As stated in read me: deepspeech --model models/output_graph.pbmm --alphabet models/alphabet.txt --lm models/lm.binary --trie models/trie --audio my_audio_file.wav
Yes, that is a problem, there can’t be a space there. The deepspeech binary, used for inference, is different from DeepSpeech.py, used for training. The latter has no --model parameter.
I am following this only. As there is no mention of used of frozen model, should I use it or no?
–initialize_from_frozen_model models/output_graph.pbmm
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
12
This code has been removed from master. There is no --initialize_from_frozen_model anymore
Thanks Carlos and Reuben for the response.
Can you guy suggest some important attributes like bit rate and accent which should be taken care of to get the best out of DeepSpeech model.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
16
Check the documentation, it’s all covered: PCM 16 bits, 16kHz mono
@lissyx , @carlfm01 & @reuben
My input audio has 44.1kHz sampling rate and I tried to downsample(16k) it through Audacity as well as sox. But I am getting very bad result after downsampling. Although I am getting decent result with original sampling rate(44.1kHz) but to get better result i tried to make it compatible with model but didn’t get what i expected.
Any clue or reason behind this??
thank you so much guys for all the previous responses
Can you share an example of the audio that you are using? If you can, share both versions of the audio.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
19
That’s unclear. The result you shared above are with which audio files ? Can you ensure it’s mono as well ? Pushing stereo at 16kHz would kind of explain that.
You want to retrain from scratch a new model with 44.1kHz ? That’s going to require a lot of data and processing power.