[D] Has any research been done to counteract the fact that each training datapoint "pulls the model in a different direction", partly undoing learning until shared features emerge? Submitted by derpderp3200 t3_zwd49c on December 27, 2022 at 11:01 AM in MachineLearning 20 comments 4
big_haptun777 t1_j1ug2fb wrote on December 27, 2022 at 2:10 PM I believe that it has already been solved via shuffling and batching. You will possibly not get stuck in local minima. Permalink −3−
Viewing a single comment thread. View all comments