I cannot get a binary compile with 0.8.1
- Training or just running inference
Initially, inference (when I can get a compile) - Moziall STT branch/version
When I go the “git clone” and submodule tensorflow, the VERSION file tells me
9-alpha something. I went to 0.8.1 release, downloaded the zip and followed
the tensorflow from there to retrieve a zip. - OS Platform and Distribution (e.g., Linux Ubuntu 18.04)
Linux - openSuSE Tumbleweed - Python version
$ python --version
Python 3.8.5 - TensorFlow version
The version retrieved by the “git submodule tensorflow” or the one referenced
by the 0.8.1 release github page (I am quite sure that I haven’t crossed over
since I do a complete purge (rm -rf STT-0.8.1 and similar for git clone)).
I haven’t found older threads that seem to address the binary build topic.
virtualenv -p python3 $HOME/tmp/deepspeech-venv/
source $HOME/tmp/deepspeech-venv/bin/activate
pip3 install deepspeech
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.8.1/deepspeech-0.8.1-models.pbmm
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.8.1/deepspeech-0.8.1-models.scorer
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.8.1/audio-0.8.1.tar.gz
tar xvf audio-0.8.1.tar.gz
deepspeech --model deepspeech-0.8.1-models.pbmm --scorer deepspeech-0.8.1-models.scorer --audio audio/2830-3980-0043.wav
… and it produces a good transcript of a 1-minute clip when I change out the audio-file.
I am not familiar with github nor bazel but I have been programming since the 1980’s both professionally and for my own curiosity. My interest in deepspeech is to rough transcribe several hundred, if not a couple of thousand, hours of a radio broadcast and deepspeech has come closest in coherence and accuracy of the systems I have tried.
I have also downloaded the binary native-client package but cannot get a compile of client.cc against this. My goal is to be able to write C code against the library such that I can automate my task. C-code for speed, as I mentioned I have a lot of audio to scan thru.
Thanks for any insight you can give me.