Submitted by Lugi t3_xt01bk in MachineLearning
killver t1_iqo6z39 wrote
Alpha in focal loss has confused me and others before. I do not understand why they built their paper writeup so heavily around it, as it was not really the contribution of the paper.
I would suggest to use a non-alpha variant in your experiments, and only think about alpha as a common way of up/downscaling classes and add it later.
Viewing a single comment thread. View all comments