GPU requirements: half (FP16), single (FP32) or double (FP64) precision floating point for calculations?

tan_oscar · September 6, 2018, 7:22pm

Hi,

Is training of Deepspeech required half (FP16), single (FP32) or double (FP64) precision floating point for calculations ?

I’m looking for price/performance for GPU cards. Any specific recommendations for GPU cards ?

Tan

lissyx · September 7, 2018, 10:00am

As per the exact usage of floating points during training, it’s not something we did look in details and it might be dependant on some tensorflow and/or cuda behavior, so we really don’t have any reliable informations on that.

For sure, the faster, the better: we do train on a some TITANX GPUs. I guess you want to max out your performance at the best budget, but still, it’d be useful to know what kind of workload you expect, and get a rough idea of your budget?

I guess there should be some way to get more instrumentation from TensorFlow. Maybe @reuben or @kdavis has any insight?

kdavis · September 7, 2018, 10:04am

We train at FP32 an, as far as I know, have never tried FP16 or FP64 training.

As to GPU recommendations, it depends. I’d suggest taking a look at Tim Dettmers’ Which GPU(s) to Get for Deep Learning as a start.

lissyx · September 7, 2018, 10:06am

Yes, that’s what I was going to comment about: the few places where we explicitely set a floating point precision is around the RNN implementation, and it’s FP32. But still, I’m not sure precisely what happens below, in TensorFlow, and even in CUDA: maybe the computation is done, at some level, with a different precision level?

reuben · September 10, 2018, 1:19pm

No, not unless you explicitly opt in. (It often requires additional changes to the training code.)

Topic		Replies	Views
Hardware for training DeepSpeech	1	784	September 14, 2018
What is the minimum GPU/CPU I Need For Custom Model DeepSpeech learning	0	416	February 18, 2022
What are the options for someone without a proper GPU? Cloud services, VMs or external GPUs? DeepSpeech	45	1629	July 7, 2020
Train and Inference on difference resources DeepSpeech	3	299	April 28, 2021
Performance (training time) of single vs dual gpu is same DeepSpeech learning	1	342	May 15, 2020

GPU requirements: half (FP16), single (FP32) or double (FP64) precision floating point for calculations?

Related topics