Loss function appears to slowly climb over batches during epoch, reset every epoch

utunga · June 13, 2018, 12:59am

Hey ya’ll,

Any help understanding this would be much appreciated.

There is a tensorflow variable called ‘loss’ which is already defined in the train() method (of DeepSpeech.py). Not suprisingly really since it is what is passed to the gradient optimizer.

I added it to my tensorboard so I could see it’s progress while training DeepSpeech on a new language.

Over the first two and a bit epochs of training the loss function looks like this:

As you can see it appears to consistently go up during the epoch not down as I would’ve expected.

Over many epochs it does what you would want it to do - track down …

But I’m wondering if any one can help explain why the loss goes up from batch to batch.

It isn’t just a question of needing to divide the loss by the batch count in order to get the ‘average’ loss per batch because - well you can see from the numbers involved… it starts at 100 then goes up to only 300 over many batches (I think about ~50 batches per epoch in this case) so its not just a sum of the loss over all batches?

I assume I am missing something super obvious here but would love to know what the story is from anyone who does know.

Thanks!

kdavis · June 13, 2018, 3:22am

DeepSpeech uses curriculum learning, an epoch starts with easier, short sentences and ends with harder, longer ones. Thus, the loss is lowest on the easy, short examples and higher on the hard, long examples.

utunga · June 13, 2018, 3:25am

oh that’s very helpful - thanks

Topic		Replies	Views
Training loss increases for each epoch DeepSpeech	2	1077	November 6, 2019
Training Loss vs Test Loss DeepSpeech	8	2577	August 26, 2019
Trainig model loss DeepSpeech	27	1144	March 13, 2020
Training of Epoch x - loss: inf. again decrease LR, stil same issue repeating DeepSpeech	10	827	October 18, 2018
Deepspeech model DeepSpeech	4	948	September 24, 2019

Loss function appears to slowly climb over batches during epoch, reset every epoch

Related topics