Viewing a single comment thread. View all comments

PleaseKillMeNowOkay OP t1_iqscxo9 wrote

The simpler model had lower training loss with the same number of epochs. I tried training the second model until it had the same training loss as the first model, which took much longer. The validation did not improve and had a slight upward trend, which I know means that it's overfitting.

1