Most AI is impossible to train(like chat GPT)

Dose LLaMa can be trained?

Although the dataset is very hard to get, It would be nice if LLaMa can be trained.

When searching for reddit, this topic cannot be searched, so I hope it becomes a discuss about HW or availability.
Thank you.

Comments

You must log in or register to comment.

CKtalon t1_jbnccl7 wrote on March 10, 2023 at 7:25 AM

If you have a few thousand A100s, sure? The dataset is fairly easily obtainable.

The next difficulty is the technical knowhow to train such LLMs.

UnusualClimberBear t1_jbngux4 wrote on March 10, 2023 at 8:23 AM

Training from scratch required 2048 A100 for 21 days. And it seems only to be the final run.

I guess you can start to fine-tune it with much lower resources, 16 A100 seems reasonable as going lower will require quantization or partial loadings for the model.

potatoandleeks t1_jbnl6se wrote on March 10, 2023 at 9:25 AM

Wow, they cost $15k a piece. So that's $30 million just for the GPUs! But since you only need them for 21 days, can probably sell them later on craigslist

QTQRQD t1_jbnmcv7 wrote on March 10, 2023 at 9:42 AM

you really think Meta spent 30 million on GPUs and then sold them on craigslist?

LetMeGuessYourAlts t1_jboc0o3 wrote on March 10, 2023 at 2:14 PM

You're right they have FB Marketplace why would they use CL?

potatoandleeks t1_jbpu6m1 wrote on March 10, 2023 at 8:08 PM

Good point

UnusualClimberBear t1_jbnoo8n wrote on March 10, 2023 at 10:15 AM

You can rent some (but not thousands) on vast.ai around $1.5 an hour

SomewhereAtWork t1_jcf9g5p wrote on March 16, 2023 at 12:38 PM

Would it be possible to train a quantzised model?

UnusualClimberBear t1_jcfd2jt wrote on March 16, 2023 at 1:08 PM

Yes, doable on a low budget if you have no fear of legal actions...

Follow https://crfm.stanford.edu/2023/03/13/alpaca.html

SomewhereAtWork t1_jcg3kak wrote on March 16, 2023 at 4:10 PM

Thank you!

SomewhereAtWork t1_jcx8aoc wrote on March 20, 2023 at 7:01 AM

> if you have no fear of legal actions...

Legal actions? They can direct that to my Legal-LLaMA. ;-)

ch9ki7 t1_jbneot3 wrote on March 10, 2023 at 7:54 AM

I would start searching on huggingface.co

[deleted] t1_jbnk1y2 wrote on March 10, 2023 at 9:08 AM

[deleted]

Raise_Fickle t1_jc1tb4r wrote on March 13, 2023 at 12:51 PM

Any idea for finetuning llama on multi-gpu setup?