Batch Size for DeepSpeech 0.4.1

20richardh · June 12, 2019, 8:49pm

Is there a recommended batch size for DeepSpeech 0.4.1?

I’m trying to finetune the prebuilt checkpoint to a dataset of 30,000 audio files. I’m currently trying a batch size of 50, but would that be too big?

Learning rate: 0.0001

Unrelated question: is it proper to run multiple epochs on the same dataset?

kdavis · June 13, 2019, 9:41am

Batch size really depends on your hardware, more specifically on the amount of GPU memory you have. Generally, if it can fit in your GPU, try it.

Running multiple epochs on a single data set is proper. However, having a dev set to monitor for overfitting is also advised.

yv001 · June 13, 2019, 10:06am

Batch size does have impact on final model accuracy though, discussed e.g. here.

Batch size=1 tends to generate most accurate model but training takes the longest time to complete.

This is also consistent with my experiments on deepspeech with different batch sizes for the same data - smaller batch sizes yield on avg lower loss on test data.

20richardh · June 13, 2019, 5:59pm

Thanks guys! That answers my question.

Sushantmkarande · June 14, 2019, 11:33am

@kdavis
how much dev set do you recommend.
if i have 1000 samples with batch size 100 in my training data set how many samples should be in dev set and its batch size…
thank you…

kdavis · June 14, 2019, 1:10pm

I recommend having a dev set that’s a “statistically sound” sample when compared to the size of your training set.

To calculate how many clips to use in a dev set I use the sample size calculator with a population size equal to the number of training clips, a confidence level of 99%, and a margin of error of 1%. For example, for 2 million training clips this gives a dev set size of 16504 clips.

For smaller sample sizes it’s much harder to get a statistically sound sample as the dev set size and the training set size end up being almost equal. For example for 1000 training clips the dev set size should be 944.

Sushantmkarande · June 14, 2019, 1:17pm

i am doing exacly opposit. i am giving 950 as train and 50 as dev.
should i increase dev to 200 because i dont have that much data.