Hi. I am trying to make a toy model with ~ 600 commands(25 minutes). Hyperparameters are below:
- Hidden layer 64
- Dropout rate 0.3
After some epochs loss goings up. And the difference between train and validation losses is big.
But loss is too high.
How can I make an accurate model?