sydjashim t1_ique162 wrote on October 3, 2022 at 3:55 AM

Reply to comment by PleaseKillMeNowOkay in Neural network that models a probability distribution by PleaseKillMeNowOkay

I have got a quick guess here.. maybe can be of help to you.. take the n-1 layers weights of your first learned model (trained weights) then try finetuning with the 4 outputs and observe either your validation loss is improving.

If so, then later you can take the untrained initial weights of your first model (till n-1th layer) then trying converging them with 4 outputs. This step is mentioned such that you have got a model started training from scratch for 4 outputs but having the same initial weights for both the models.

Why am i saying this ?

Well. I think you could try in this way since you expect to keep maximum params esp. model parameters (weights) similar while running the comparision between them.

sydjashim t1_iqtuqdt wrote on October 3, 2022 at 1:17 AM

Reply to comment by thebear96 in Neural network that models a probability distribution by PleaseKillMeNowOkay

Can you reason out why the model will get converged quicker ?

sydjashim t1_iqtu9hp wrote on October 3, 2022 at 1:13 AM

Reply to Neural network that models a probability distribution by PleaseKillMeNowOkay

Did you keep same initial weights for both the networks ?