[D] How do you go about hyperparameter tuning when network takes a long time to train? Submitted by twocupv60 t3_xvem36 on October 4, 2022 at 1:05 PM in MachineLearning 65 comments 87
RandomIsAMyth t1_ir0hh8v wrote on October 4, 2022 at 1:24 PM Smaller networks is one way to go indeed. Have a similar architecture but smaller. Much smaller such that you can have a result in ~1h. Then you can just distribute the process using weights and biases or another similar framework. Permalink 2
Viewing a single comment thread. View all comments