Submitted by Beautiful-Gur-9456 t3_124jfoa in MachineLearning
noraizon t1_je10328 wrote
x0-parametrization has been used for some time now. imo, nothing new under the sun. maybe it's something else I don't see
Beautiful-Gur-9456 OP t1_je18sc5 wrote
You're totally right 😅 I think the true novelty here is dropping distillation and introducing a BYoL-like simple formulation. Bootstrapping always feels like magic to me.
Viewing a single comment thread. View all comments