ARM arch taskcluster native client package not including libctc_decoder_with_kenlm.so

mar_martinez · November 4, 2018, 1:01am

Hi,

I have tried to download native_client prebuild binaries an libraries for Raspberry Pi3 with this command:

python util/taskcluster.py --arch arm  --target .

And surprisingly he library libctc_decoder_with_kenlm.so is not in the package:

Downloading https://index.taskcluster.net/v1/task/project.deepspeech.deepspeech.native_client.master.arm/artifacts/public/native_client.tar.xz ...
Downloading: 100%

generate_trie
libdeepspeech.so
LICENSE
deepspeech
README.mozilla

Is it embedded somehow in the libdeepspeech.so compilation? The native_client/BUILD file does not seems to suggest that.

I’ve tried with other branches (v0.2.1-alpha.2, v0.3.0) but it is missing in all of them, except for v0.1.1 where it shows up again:

python util/taskcluster.py --arch arm --target . --branch "v0.1.1"

Downloading https://index.taskcluster.net/v1/task/project.deepspeech.deepspeech.native_client.v0.1.1.arm/artifacts/public/native_client.tar.xz ...

Downloading: 100%

libtensorflow_cc.so
libtensorflow_framework.so
generate_trie
libctc_decoder_with_kenlm.so
libdeepspeech.so
libdeepspeech_utils.so
LICENSE
deepspeech
README.mozilla

All the branches for MacOS arch do include this library as well.
Please, what am I doing wrong?.

Thanks in advance,
Mar

lissyx · November 5, 2018, 10:14am

Nothing, we don’t support training on ARM hardware. But you can build it if you need.

mar_martinez · November 5, 2018, 10:33am

Hi, I am not talking about ARM training but about native_client execution on raspberry pi3…

lissyx · November 5, 2018, 10:34am

This does not depends on libctc_decoder_with_kenlm.so, that library is only required at training.

mar_martinez · November 5, 2018, 10:49am

Yes, Alexander, I remember now,

I was trying to execute DeepSpeech.py “–one_shot_infer” on Raspberry Pi before using native_client deepspeech binary.

In the native_client the n_mfcc is not a parameter but fixed to 26 and my models are all 13, and the sample_rate is fixed as well, and I am using 8KHz.

So I wanted to have a sneak peek of the performance as a prototype before being force to modify and re-compile the native_client again for the newer versions, and then is when it failed.

Tensorflow is delivering now (since v1.9) wheel packages for Raspberry Pi, and this allow us to use python for inference prototyping in a fast way, without the recompiling pain, but the libctc_decoder_with_kenlm.so library is not available in taskcluster for arm.

Could it be?

Best regards,
Mar

lissyx · November 5, 2018, 10:53am

Given the current performances for just inference on RPi3, that’s just a waste of time to support that. Even if there are wheels available, it’s not usable in this case, because it’s way too slow. And libctc_decoder_with_kenlm.so is going away soon.

lissyx · November 5, 2018, 10:54am

I don’t get your point, you can just rebuild the native client, you don’t need to rebuild everything, if that’s just what you want to change. Just hack native_client/client.cc and follow the docs to rebuild, you have everything required to link in native_client.tar.xz.

mar_martinez · November 5, 2018, 10:59am

Ok, thanks for the quick response.
Mar

P.S. The models I am training are much more smaller, not 2048 nodes, what would be for sure unbearable in a raspberry o tinkerboard

mar_martinez · November 5, 2018, 11:10am

So… one small question, the way the trie is made is going to change again or only the usage code? I mean, will I able to use the same .trie file from v0.2 or will I regenerate it? and will it affect to the trained models I have now?
BR

mar_martinez · November 5, 2018, 11:21am

This is not exactly that way, look at the beginning of deepspeech.cc file:

//TODO: use dynamic sample rate
const unsigned int SAMPLE_RATE = 16000; 

const float AUDIO_WIN_LEN = 0.025f;
const float AUDIO_WIN_STEP = 0.01f;
const unsigned int AUDIO_WIN_LEN_SAMPLES = (unsigned int)(AUDIO_WIN_LEN * SAMPLE_RATE); 
const unsigned int AUDIO_WIN_STEP_SAMPLES = (unsigned int)(AUDIO_WIN_STEP * SAMPLE_RATE);

Best Regards,
Mar

lissyx · November 5, 2018, 11:35am

It’s being completely replaced, and the trie file will change

lissyx · November 5, 2018, 11:48am

Ok, you changed the model, so it makes sense. May I ask why you still want to train on RPi3 ? It’s much much much much faster to train on a desktop GPU.

mar_martinez · November 5, 2018, 11:56am

No, I am not training in a RPi3, but in Ubuntu or Mac, with desktop GPU.

The RPi3 is intended to be the inference platform where the trained (tiny) model will be executed, or better the Asus Tinkerboard.

I was just using python inference as a first prototype (before native_client recompilation), this is when I missed the libctc_decoder_with_kenlm.so in the taskcluster.

Thanks,
Mar

lissyx · November 5, 2018, 1:21pm

Ok, that makes sense. Cross-compiling it should not be very hard, nor very long, that’s the best I can suggest for the moment.

Topic		Replies	Views
Native client problem on Raspbian DeepSpeech	6	948	January 24, 2019
Python inference and raspberry pi DeepSpeech 0.4.1 DeepSpeech	4	756	January 25, 2019
Error when running inference on an audio file DeepSpeech	40	4141	September 7, 2018
DeepSpeech native client compilation for Asus Thinkerboard DeepSpeech	20	3402	May 25, 2019
ARM native_client with GPU support DeepSpeech	54	5600	July 27, 2018

ARM arch taskcluster native client package not including libctc_decoder_with_kenlm.so

Related topics