XtremePocket t1_ir0xpjw wrote on October 4, 2022 at 3:18 PM Reply to [D] How do you go about hyperparameter tuning when network takes a long time to train? by twocupv60 Mu transfer has (sort of) a theoretically guaranteed way of transferring the optimal hyperparameters of scaled down versions of a model to it. I haven’t tried it in practice, but maybe give that a try? Permalink 3
XtremePocket t1_ir0xpjw wrote
Reply to [D] How do you go about hyperparameter tuning when network takes a long time to train? by twocupv60
Mu transfer has (sort of) a theoretically guaranteed way of transferring the optimal hyperparameters of scaled down versions of a model to it. I haven’t tried it in practice, but maybe give that a try?