Requirements for DeepSpeech

csawkar1215 · September 4, 2018, 6:47pm

what are the minimum platform requirements both hardware and software to download and run deepSpeech code? for example hardware, os, python, tf versions etc

lissyx · September 5, 2018, 9:51am

This is documented in the README, in the very first section, though it’s outdated now and valid for the older versions (0.1.1 model), newer should even require less power: https://github.com/mozilla/DeepSpeech/blob/master/README.md

csawkar1215 · September 5, 2018, 11:31am

Thanks! But Under Table of Contents and Prerequisites it says Python and Git Large File Storage. It does not say anything about the other requirements.

lissyx · September 5, 2018, 11:33am

Check above that part, it gives figures about some hardware

csawkar1215 · September 5, 2018, 12:34pm

It says please check runtime dependencies. That link does not give os, hardware, tensoflow version etc

lissyx · September 5, 2018, 1:15pm

Screenshot_2018-09-05%20mozilla%20DeepSpeech

lissyx · September 5, 2018, 1:16pm

Giving CPU status is much more complicated, because even with the same model of CPU we saw big variances depending on a lot of factors.

csawkar1215 · September 5, 2018, 1:35pm

Thanks. If I use the CPU model and not use GPUs, what hardware for example and OS do I need ?

lissyx · September 5, 2018, 1:38pm

As documented, if you use our prebuilt binaries, you need some CPU with at least AVX instructions set. Also, as documented, we have binaries for Linux/AMD64, OSX/AMD64, and some ARM (strictly RPi3B) and ARM64 systems (should run on any Debian Stretch aarch64 distro, tested on Le Potato board).

csawkar1215 · September 5, 2018, 1:57pm

thanks again. After I have the hardware , say LInux HW, after some installations and steps if I do
git clone https://github.com/mozilla/DeepSpeech

get the code and run
run-ldc93s1.sh

it should work?

csawkar1215 · September 5, 2018, 2:13pm

Do Prebuilt binaries means pre-trained binaries ?

lissyx · September 5, 2018, 2:16pm

@csawkar1215 It would have been easier that you stated what you want. If you are looking at training your own model, it’s not the same, your require some good GPUs to be able to achieve anything.

lissyx · September 5, 2018, 2:27pm

No, it means pre-built binaries, to run inference.

lissyx · September 5, 2018, 2:28pm

It’s all documented: https://github.com/mozilla/DeepSpeech/blob/master/README.md#training

csawkar1215 · September 5, 2018, 2:32pm

thank you for the details.

lissyx · September 5, 2018, 2:36pm

I’m sorry, but without more details on what you are trying to do, it’s hard to be more helpful. Full training of the previous 0.1.1 model on the whole set of data we have (several thousands of english audio) on something like 16x TITAN X GPUs would take around 1 week.

csawkar1215 · September 5, 2018, 2:43pm

what does inference of prebuilt binaries mean?

lissyx · September 5, 2018, 2:56pm

binaries to compute audio to text

csawkar1215 · September 5, 2018, 2:59pm

what do you feed to the binary and what is the output ?

lissyx · September 5, 2018, 3:00pm

Again, it’s all documented: WAV 16 bits, text output.

Topic		Replies	Views
Unable to install deepspeech on centos 6.9 DeepSpeech	36	4121	March 5, 2018
What is the use of pre-built binaries DeepSpeech	9	5566	February 20, 2018
Deepspeech on ubuntu 18.04 DeepSpeech	11	2942	September 6, 2018
Pretrained model : Release- version 0.4.0 DeepSpeech	21	2488	December 10, 2019
Error after following installation steps DeepSpeech	23	4678	September 10, 2018

Requirements for DeepSpeech

Related topics