I am wondering if it is possible to extract words from the streaming processor as they are produced. Assuming that the words are processed in real time (or close to it) would it be possible to extract those words without having to end the streaming process? As of now the program only outputs your speech with model.finishStream()
Thanks!
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
2
Note that currently DS_IntermediateDecode is very expensive, you can’t keep calling it indefinitely, it requires use of a voice activity detection module to find silence points and call DS_FinishStream() so that the stream doesn’t go on for too long. Making DS_IntermediateDecode be fast requires a streaming decoder, which is something that’s on the backlog but I haven’t had time to work on. If anyone wants to work on this I can give guidance.