Submitted by faker10101891 t3_10cxuo2 in MachineLearning
currentscurrents t1_j4jj1l6 wrote
Reply to comment by junetwentyfirst2020 in [D] What kinds of interesting models can I train with just an RTX 4080? by faker10101891
It's a little discouraging when every interesting paper has a cluster of 64 A100s in their methods section.
junetwentyfirst2020 t1_j4jkejb wrote
The first image transformer is pretty clear that it works better at scale. You might not need a transformer for interesting work though.
You can do so much with that GPU. I think transformers are heavier models, but my background is on CNNs and those work fine on your GPU.
Viewing a single comment thread. View all comments