I trained the model on a collected domain specific dataset with a custom scorer. The final WER was really good (got from evaluate.py). But when I used to exported model (using deepspeech library) for one of the file from the same test dataset, the result was totally different. (note : the file was picked from the same dataset that the checkpoints were evaluated on.)
original transcript : “wrong interpretation of the values at intermediate positions so that is the basic problem of”
result from checkpoints : “wrong interpretation of the values at intermediate positions so that is the basic problem of”
If your data is at a different sample rate than 16 kHz, the training code will handle it automatically, but the inference code won’t, you have to specify the --audio_sample_rate flag at export time. I don’t remember if 0.7.4 will handle this properly but that’s the first thing that comes to mind. You should be getting a warning about the sample rate mismatch on the client side tho…
thanks a lot for the quick response. But the sampling rate is 16khz only and the scorer files and all of it is in place (double-checked that too).
Moving to 0.9.2 seems like a good idea, but as I work in a remote server without admin access, setting up the env takes a little more time and effort. I used following command to test the ineference on exported model.
I took few random samples from the test files…tested them with original deepspeech model and scorer. Issue was same. Tried some different combinations and files outiside the dataset. It seems there is some issue with the domain specific audio file.I manually listened to some of them and the content was there. Checked the sampling rate and channels and found them to be 16KHz and 1 respectively. What else could go wrong with the audio file ??
Again, there is no difference between checkpoint and pb model. So you must have changed something else. Same server, same environment, same DS version? Somewhere along the road you changed something. Why don’t you set up a fresh environment and check versions, so you got 0.7.4? Pip install will give you 0.9.x if you don’t watch out.