For the record, we’d be more than happy to share the work we did on this. Honestly, it’s not very much though, just adding a few bits here and there and I think the only reason it hasn’t landed back into the Mozilla repo as a PR is that we’re still working on sorting out a clean PR against an up to date version. (Because DeepSpeech itself keeps moving ! ) Some other changes we made have made it back into the Mozilla master though which is nice.
As far as the timing changes go, the first time we made changes it was at the 0.5.1 version which you can see here. That worked OK, but then the whole way that scoring worked changed so we had to redo for 0.6 which is this version I think. The 0.5.1 version got a bit more in depth testing but either one should work.
I’ve recently been working on re-applying these changes for the 0.7.1 version of DeepSpeech which you can see here unfortunately the way the scoring model worked has changed quite a bit and it’s not really working at this point. So I wouldn’t recommend trying to use the 0.7 version of the changes - its still got problems. I created this this post yesterday to ask about those problems. But feel free to use the older vesrions if that would help. Bear in mind that models trained on versions where x is different (in the 0.x.y formatting) tend to not be compatible with each other.
Would welcome your input and help with that. Experimenting with it yourself would be great for example. That said very very far from prime time.
I think it would be fair to say we definitely intend to produce a clean PR and land these changes back to Mozilla repo but it may take some months before that happens. Also, as I mentioned they’re honestly not that much to write home about at this point.
Hopefully, we can collaborate on making this happen but either way, I hope you’ll be able to get letter confidences in future iterations of DeepSpeech.