Most AI is impossible to train(like chat GPT)

Dose LLaMa can be trained?

Although the dataset is very hard to get, It would be nice if LLaMa can be trained.

When searching for reddit, this topic cannot be searched, so I hope it becomes a discuss about HW or availability.
Thank you.

Comments

You must log in or register to comment.

CKtalon t1_jbnccl7 wrote on March 10, 2023 at 7:25 AM

#2,199,168

If you have a few thousand A100s, sure? The dataset is fairly easily obtainable.

The next difficulty is the technical knowhow to train such LLMs.

ch9ki7 t1_jbneot3 wrote on March 10, 2023 at 7:54 AM

#2,199,227

I would start searching on huggingface.co

UnusualClimberBear t1_jbngux4 wrote on March 10, 2023 at 8:23 AM

#2,199,279

Training from scratch required 2048 A100 for 21 days. And it seems only to be the final run.

I guess you can start to fine-tune it with much lower resources, 16 A100 seems reasonable as going lower will require quantization or partial loadings for the model.

[deleted] t1_jbnk1y2 wrote on March 10, 2023 at 9:08 AM

#2,199,359

[deleted]

potatoandleeks t1_jbnl6se wrote on March 10, 2023 at 9:25 AM

#2,199,382

Replying to UnusualClimberBear (#2,199,279)

Wow, they cost $15k a piece. So that's $30 million just for the GPUs! But since you only need them for 21 days, can probably sell them later on craigslist

QTQRQD t1_jbnmcv7 wrote on March 10, 2023 at 9:42 AM

#2,199,410

Replying to potatoandleeks (#2,199,382)

you really think Meta spent 30 million on GPUs and then sold them on craigslist?

UnusualClimberBear t1_jbnoo8n wrote on March 10, 2023 at 10:15 AM

#2,199,489

Replying to potatoandleeks (#2,199,382)

You can rent some (but not thousands) on vast.ai around $1.5 an hour

LetMeGuessYourAlts t1_jboc0o3 wrote on March 10, 2023 at 2:14 PM

#2,200,396

Replying to QTQRQD (#2,199,410)

You're right they have FB Marketplace why would they use CL?

potatoandleeks t1_jbpu6m1 wrote on March 10, 2023 at 8:08 PM

#2,203,369

Replying to LetMeGuessYourAlts (#2,200,396)

Good point

Raise_Fickle t1_jc1tb4r wrote on March 13, 2023 at 12:51 PM

#2,219,890

Any idea for finetuning llama on multi-gpu setup?

SomewhereAtWork t1_jcf9g5p wrote on March 16, 2023 at 12:38 PM

#2,243,425

Replying to UnusualClimberBear (#2,199,279)

Would it be possible to train a quantzised model?

UnusualClimberBear t1_jcfd2jt wrote on March 16, 2023 at 1:08 PM

#2,243,642

Replying to SomewhereAtWork (#2,243,425)

Yes, doable on a low budget if you have no fear of legal actions...

Follow https://crfm.stanford.edu/2023/03/13/alpaca.html

SomewhereAtWork t1_jcg3kak wrote on March 16, 2023 at 4:10 PM

#2,245,340

Replying to UnusualClimberBear (#2,243,642)

Thank you!

SomewhereAtWork t1_jcx8aoc wrote on March 20, 2023 at 7:01 AM

#2,277,469

Replying to UnusualClimberBear (#2,243,642)

> if you have no fear of legal actions...

Legal actions? They can direct that to my Legal-LLaMA. ;-)