[D] What are the major general advances in techniques? Submitted by windoze t3_ylixp5 on November 3, 2022 at 11:50 PM in MachineLearning 26 comments 41
Gere1 t1_iv0505o wrote on November 4, 2022 at 8:41 AM Does someone know a good ablation study of the mentioned techniques. I've seen results where neither dropout nor layer normalization did much. So I wonder if these 2 techniques are a believe or still crucial. Permalink 2
Viewing a single comment thread. View all comments