Submitted by Thijs-vW t3_yta05n in deeplearning
Thijs-vW OP t1_iw6rmvh wrote
Reply to comment by scitech_boom in Update an already trained neural network on new data by Thijs-vW
>Anyhow, you cannot do this:
I do not understand why I cannot use train my already trained model on new data. Could you elaborate?
scitech_boom t1_iw6zpa9 wrote
There are multiple reasons. The main issue has to do with validation error. It usually follows a U curve, with a minimum at some epoch. This is the point at which we usually stop the training (`early stopping`). Any further training, with or without new data is only going to make the performance worse (I don't have a paper to cite for that).
I also started with the best model and that did not work. But when I took the model 2 epochs before the best model, it worked well. In my case(speech recognition), it was a nice balance between improvement and training time.
Viewing a single comment thread. View all comments