I’m using Windows, so I don’t believe I can use the deepspeech package, so I downloaded the pretrained model and have loaded it in my script using keras. My question is how do I go about using the model, like what type of input is it expecting, how should audio be preprocessed, and what kind of output does the model give. I don’t quite understand how the output graph works either.
Apologies if this seems like a basic problem.
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
2
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
3
I would suggest to just have a look at how feeding is performed in native_client/deepspeech.cc as well as the export() function in DeepSpeech.py, that should help you to understand?
@jafioti Just notice with the current nuget you can’t use the Interface with the using (), use the class instead, it was a change that is not in the package yet
lissyx
((slow to reply) [NOT PROVIDING SUPPORT])
8
It’s in master, but I’m still working on TaskCluster builds for Windows, it will be at best a few days before we can push any alpha to NuGet.org …
@jafioti Not working at which point? Installing it, running it? CPU or GPU? What’s working? At the time you deleted I was reading. Make sure you are running the project under x64 CPU configuration.