Reason for using Mozilla's TensorFlow fork

shan18 · January 31, 2020, 6:00am

If we want to build our own binaries then it’s mentioned in this README that one of the prerequisites is that we have to install the Mozilla’s TensorFlow r1.15 branch since it fixes some common problems.

Can you please specify in general what were the problems encountered i.e. what was the need for using the fork of TensorFlow?

lissyx · January 31, 2020, 10:25am

First, we need the fork for some CI integration. We also have a few extra fixes that we apply until upstream picks them, which can take time. We also have more specific improvements, namely for cross-compilation.

Upstreaming them is always good, but not always working …

No, you just need our code to build, but training model does not require our fork, you (and we do) can just use upstream …

shan18 · January 31, 2020, 10:35am

Thank you for the response.

lissyx · January 31, 2020, 10:40am

 .bazelrc                                           |  16 ++
 .taskcluster.yml                                   |  56 ++++++
 compilers/BUILD                                    |   1 +
 compilers/linaro-gcc49-aarch64.BUILD               |  81 +++++++++
 compilers/linaro-gcc49-armeabi.BUILD               |  81 +++++++++
 compilers/linaro-gcc72-aarch64.BUILD               |  81 +++++++++
 compilers/linaro-gcc72-armeabi.BUILD               |  81 +++++++++
 package.sh                                         |   1 +
 taskcluster/.build.yml                             |   6 +
 taskcluster/android-arm64-opt.yml                  |  12 ++
 taskcluster/android-armv7-opt.yml                  |  12 ++
 taskcluster/darwin-amd64-cpu-opt.yml               |  10 ++
 taskcluster/darwin-opt-base.tyml                   |  75 ++++++++
 taskcluster/github-events.cyml                     |  14 ++
 taskcluster/linux-amd64-cpu-opt.yml                |   9 +
 taskcluster/linux-amd64-gpu-opt.yml                |  13 ++
 taskcluster/linux-arm64-cpu-opt.yml                |  12 ++
 taskcluster/linux-opt-base.tyml                    |  62 +++++++
 taskcluster/linux-rpi3-cpu-opt.yml                 |  12 ++
 taskcluster/win-amd64-cpu-opt.yml                  |  10 ++
 taskcluster/win-amd64-gpu-opt.yml                  |  13 ++
 taskcluster/win-opt-base.tyml                      |  76 ++++++++
 taskcluster/worker.cyml                            |  11 ++
 tc-apt.sh                                          |  26 +++
 tc-brew.sh                                         |  45 +++++
 tc-build.sh                                        | 105 +++++++++++
 tc-package.sh                                      |  62 +++++++
 tc-pacman.sh                                       |  14 ++
 tc-pip.sh                                          |   7 +
 tc-schedule.sh                                     |  11 ++
 tc-setup.sh                                        | 109 ++++++++++++
 tc-vars.sh                                         | 197 +++++++++++++++++++++
 tensorflow/BUILD                                   |  24 +++
 tensorflow/core/BUILD                              |  30 +---
 .../core/grappler/optimizers/data/rebatch.cc       |   9 +-
 tensorflow/core/kernels/BUILD                      |  44 +++--
 tensorflow/core/kernels/constant_op.cc             |   1 +
 tensorflow/core/platform/default/build_config.bzl  |   2 +
 tensorflow/core/util/tensor_bundle/BUILD           |   2 +
 tensorflow/lite/build_def.bzl                      |   1 +
 tensorflow/lite/kernels/internal/BUILD             |  24 +++
 tensorflow/tensorflow.bzl                          |   6 +
 tensorflow/workspace.bzl                           |  40 +++++
 third_party/aws/BUILD.bazel                        |   6 +
 third_party/icu/BUILD.bazel                        |   5 +-
 third_party/repo.bzl                               |   4 +
 tools/arm_compiler/BUILD                           |   1 +
 tools/arm_compiler/linaro-gcc49-aarch64/BUILD      |  64 +++++++
 tools/arm_compiler/linaro-gcc49-aarch64/CROSSTOOL  | 168 ++++++++++++++++++
 tools/arm_compiler/linaro-gcc49-aarch64/gcc/BUILD  |  79 +++++++++
 .../linaro-gcc49-aarch64/gcc/aarch64-linux-gnu-ar  |   5 +
 .../linaro-gcc49-aarch64/gcc/aarch64-linux-gnu-as  |   5 +
 .../linaro-gcc49-aarch64/gcc/aarch64-linux-gnu-cpp |   5 +
 .../linaro-gcc49-aarch64/gcc/aarch64-linux-gnu-gcc |   6 +
 .../gcc/aarch64-linux-gnu-gcov                     |   5 +
 .../linaro-gcc49-aarch64/gcc/aarch64-linux-gnu-ld  |   5 +
 .../linaro-gcc49-aarch64/gcc/aarch64-linux-gnu-nm  |   5 +
 .../gcc/aarch64-linux-gnu-objcopy                  |   5 +
 .../gcc/aarch64-linux-gnu-objdump                  |   5 +
 .../gcc/aarch64-linux-gnu-strip                    |   5 +
 tools/arm_compiler/linaro-gcc49-armeabi/BUILD      |  64 +++++++
 tools/arm_compiler/linaro-gcc49-armeabi/CROSSTOOL  | 168 ++++++++++++++++++
 tools/arm_compiler/linaro-gcc49-armeabi/gcc/BUILD  |  79 +++++++++
 .../gcc/arm-linux-gnueabihf-ar                     |   5 +
 .../gcc/arm-linux-gnueabihf-as                     |   5 +
 .../gcc/arm-linux-gnueabihf-cpp                    |   5 +
 .../gcc/arm-linux-gnueabihf-gcc                    |   6 +
 .../gcc/arm-linux-gnueabihf-gcov                   |   5 +
 .../gcc/arm-linux-gnueabihf-ld                     |   5 +
 .../gcc/arm-linux-gnueabihf-nm                     |   5 +
 .../gcc/arm-linux-gnueabihf-objcopy                |   5 +
 .../gcc/arm-linux-gnueabihf-objdump                |   5 +
 .../gcc/arm-linux-gnueabihf-strip                  |   5 +
 tools/arm_compiler/linaro-gcc72-aarch64/BUILD      |  64 +++++++
 tools/arm_compiler/linaro-gcc72-aarch64/CROSSTOOL  | 168 ++++++++++++++++++
 tools/arm_compiler/linaro-gcc72-aarch64/gcc/BUILD  |  79 +++++++++
 .../linaro-gcc72-aarch64/gcc/aarch64-linux-gnu-ar  |   5 +
 .../linaro-gcc72-aarch64/gcc/aarch64-linux-gnu-as  |   5 +
 .../linaro-gcc72-aarch64/gcc/aarch64-linux-gnu-cpp |   5 +
 .../linaro-gcc72-aarch64/gcc/aarch64-linux-gnu-gcc |   6 +
 .../gcc/aarch64-linux-gnu-gcov                     |   5 +
 .../linaro-gcc72-aarch64/gcc/aarch64-linux-gnu-ld  |   5 +
 .../linaro-gcc72-aarch64/gcc/aarch64-linux-gnu-nm  |   5 +
 .../gcc/aarch64-linux-gnu-objcopy                  |   5 +
 .../gcc/aarch64-linux-gnu-objdump                  |   5 +
 .../gcc/aarch64-linux-gnu-strip                    |   5 +
 tools/arm_compiler/linaro-gcc72-armeabi/BUILD      |  64 +++++++
 tools/arm_compiler/linaro-gcc72-armeabi/CROSSTOOL  | 168 ++++++++++++++++++
 tools/arm_compiler/linaro-gcc72-armeabi/gcc/BUILD  |  79 +++++++++
 .../gcc/arm-linux-gnueabihf-ar                     |   5 +
 .../gcc/arm-linux-gnueabihf-as                     |   5 +
 .../gcc/arm-linux-gnueabihf-cpp                    |   5 +
 .../gcc/arm-linux-gnueabihf-gcc                    |   6 +
 .../gcc/arm-linux-gnueabihf-gcov                   |   5 +
 .../gcc/arm-linux-gnueabihf-ld                     |   5 +
 .../gcc/arm-linux-gnueabihf-nm                     |   5 +
 .../gcc/arm-linux-gnueabihf-objcopy                |   5 +
 .../gcc/arm-linux-gnueabihf-objdump                |   5 +
 .../gcc/arm-linux-gnueabihf-strip                  |   5 +
 tools/arm_compiler/nsync_rpi3.patch                |  64 +++++++
 100 files changed, 2992 insertions(+), 40 deletions(-)

As you can see, not that many many changes to the tensorflow/ part.

shan18 · February 3, 2020, 4:46am

Was one of the reasons for using a custom TensorFlow is to use KenLM with CTC beam search decoder? Or is it possible to use KenLM with CTC decoder using native tensorflow?

lissyx · February 3, 2020, 7:56am

I don’t think we had to do funny things because of CTC decoder. Would you please elaborate why you are asking this ?

shan18 · February 3, 2020, 7:59am

I have modified deepspeech architecture a little bit and I am trying to re-compile the binaries according to the modified deepspeech in order to perform inference on the model. There isn’t any documentation for the ds_ctcdecoder package available so it is somewhat difficult to understand the process. This is why I was asking.

lissyx · February 3, 2020, 9:17am

What kind of changes have you made that requires you to rebuild everything ?

Please be more specific, there is documentation on how to rebuild that, so I don’t understand your statement.

shan18 · February 3, 2020, 9:40am

I added 2 convolutional layers at the beginning of the model, now since the model architecture has changed, will the same binaries work?

By documentation I meant how is the ctc decoder working behind the scenes.

And if I am building a completely different speech to text model, these binaries won’t work right because they are built for deepspeech? In that case I guess, I would have to modify those binaries. Please correct me if I am wrong.

lissyx · February 3, 2020, 9:46am

That depends on how you performed those changes. Did you modify the user-facing input ? Could you share a diff ?

We don’t change how the CTC decoder changes, we just provide scoring.

I can’t do divination, so please show me the code.

shan18 · February 3, 2020, 11:20am

No, the user-facing input is still the MFCC vectors. After the input, I apply convolution layers on it. The output is the character probabilities for each time-step (same as deepspeech).

I’m sorry, I didn’t keep the code on git yet.

I was referring to the documentation of this package.

Here is the model architecture

Input (audio_input): 26 size MFCC vector i.e. (batch_size, time_steps, 26)
Output (prediction): character probabilities for each time-step i.e. (batch_size, time_steps, 29).

lissyx · February 3, 2020, 12:16pm

If you did no change to input or output, then you should be able to use the binaries we provide.

You could share that here. Hard to ensure compatibility this way.

I still don’t understand your question.

I’m unsure we have the TensorFlow ops in the binaries for the convolutions, though. Maybe TFLite runtime would work out of the box. You should just try both at first, before trying to rebuild …

shan18 · February 3, 2020, 12:20pm

Alright, understood. I just wanted to use kenlm with the beam search decoder in my new model and wanted to use the generate_trie file with the new model.

I guess it could work if the model input and output is same. Thanks a lot, it was a huge help.

lissyx · February 3, 2020, 12:39pm

Without looking at the code I can’t be definitive, but it seems to be the case and the only limitation, as I said, might be the TensorFlow runtime, but since we now have TFLite available everywhere you can try easily. Seek for 0.7.0-alpha.0, this is going to be compatible with 0.6.1 for the moment, and provide tflite everywhere.

shan18 · February 3, 2020, 12:40pm

Okay, got it. Thank you.

Topic		Replies	Views
Building beta native client from source using recent TensorFlow version DeepSpeech	7	587	July 26, 2018
Which tensorflow (mozilla vs. upstream) use for inference/training/buidling? DeepSpeech	2	536	February 28, 2018
Building the Native Client & Tensorflow from scratch DeepSpeech	18	2422	November 29, 2018
Issue with building release version 0.1.1 DeepSpeech and Tensorflow DeepSpeech	4	685	August 16, 2018
Deepspeech on Android DeepSpeech	12	8287	April 29, 2021

Reason for using Mozilla's TensorFlow fork

Related topics