Hi! In short I would like to run the CTC decoder on android with custom logits. I have model of my own which produces logits which should be then processed with CTC decoder. I would like to use deepspeech ctc decoder with scorer.
I went over documentation for android support and Java API. I also tested libdeepspeech from JCenter. It seems that the current API doesn’t provide such option. It only runs the whole process recognition + decoding. I hope that it won’t be that hard to add some binding for the CTC decoder since the API already provides functions as providing own scorer and setting beam width.
Could somebody please guide me to place where I should make appropriate changes to be able to call CTC decoder separately in Java (Android)?
Thank You in Advance
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
2
Sorry, but providing bindings for the CTC decoder is not really something we want to support.
Yes, because that’s the purpose of our API. The ctc decoding in itself is a technical detail of implementation.
The ctc decoder is well separated in native_client/ctcdecoder, so you should be able to directly call that from java through JNI.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
3
Hi! I am facing exactly the same issue. Did you have any success running the ctc decoder separately in Android ? Any hints would be highly appreciated.