Submitted by netw0rkf10w t3_zmpdo0 in MachineLearning
netw0rkf10w OP t1_j2939o2 wrote
Reply to comment by TimDarcet in [D] What are the strongest plain baselines for Vision Transformers on ImageNet? by netw0rkf10w
You are right, indeed. Not sure why I missed that. I guess one can conclude that DeiT 3 is currently SoTA for training from scratch.
Viewing a single comment thread. View all comments