Submitted by New_Yak1645 t3_11nhl03 in MachineLearning
UnusualClimberBear t1_jbngux4 wrote
Training from scratch required 2048 A100 for 21 days. And it seems only to be the final run.
I guess you can start to fine-tune it with much lower resources, 16 A100 seems reasonable as going lower will require quantization or partial loadings for the model.
potatoandleeks t1_jbnl6se wrote
Wow, they cost $15k a piece. So that's $30 million just for the GPUs! But since you only need them for 21 days, can probably sell them later on craigslist
QTQRQD t1_jbnmcv7 wrote
you really think Meta spent 30 million on GPUs and then sold them on craigslist?
LetMeGuessYourAlts t1_jboc0o3 wrote
You're right they have FB Marketplace why would they use CL?
potatoandleeks t1_jbpu6m1 wrote
Good point
UnusualClimberBear t1_jbnoo8n wrote
You can rent some (but not thousands) on vast.ai around $1.5 an hour
SomewhereAtWork t1_jcf9g5p wrote
Would it be possible to train a quantzised model?
UnusualClimberBear t1_jcfd2jt wrote
Yes, doable on a low budget if you have no fear of legal actions...
SomewhereAtWork t1_jcg3kak wrote
Thank you!
SomewhereAtWork t1_jcx8aoc wrote
> if you have no fear of legal actions...
Legal actions? They can direct that to my Legal-LLaMA. ;-)
Viewing a single comment thread. View all comments