Deepspeech inference with multiple gpu

Yerlan_Amanzholov · February 11, 2020, 5:07am

Hello,
I am currently trying to inference a large number of files using a trained model. During inference, deepspeech python client uses only single gpu out of 3. How can I extend it to use all of them in parallel?

lissyx · February 11, 2020, 8:41am

We don’t have support for batch inference in the library currently, your best solution is evaluate.py / transcribe.py

reuben · February 11, 2020, 9:32am

The inference process is bottlenecked by the decoder which is CPU only. Using all GPUs won’t gain you much performance, which is why evaluate.py and transcribe.py only use a single GPU.

Jendker · March 9, 2020, 5:15pm

Is there any way of getting the metadata with evaluate.py or transcribe.py?

lissyx · March 9, 2020, 5:26pm

There’s always ways, but this codes lives in libdeepspeech and so it’s quite some work to do.

reuben · March 9, 2020, 6:58pm

The full metadata is already returned by the bindings, it’s just processed in native_client/ctcdecode/__init__.py to return just (confidence, transcript) tuples. You should be able to edit that file to get that info exposed to Python.

Topic		Replies	Views
How to infer multiple audio file using gpu (version 0.6.1a) DeepSpeech	0	307	December 30, 2019
How to efficiently run concurrent inferences with DeepSpeech model DeepSpeech	1	908	February 20, 2019
Batching during inference DeepSpeech	1	626	September 24, 2018
How can I run inference on multiple files using the pre trained model DeepSpeech	6	1276	April 5, 2020
Processing multiple audio at a time using multithreading DeepSpeech	12	1612	December 19, 2019

Deepspeech inference with multiple gpu

Related topics