Dr_Singularity OP t1_iwj0ct4 wrote on November 15, 2022 at 11:59 PM

#540,647

It Delivers Near Perfect Linear Scaling for Large Language Models

Rakshear t1_iwjgzs8 wrote on November 16, 2022 at 2:09 AM

#541,641

Wtf? This is freaking awesome, we might actually see a 2030 date for the beginning.

ihateshadylandlords t1_iwjh7dr wrote on November 16, 2022 at 2:11 AM

#541,654

So what are the implications of this? From what I could tell from the article, it looks like it trains LLMs faster.

94746382926 t1_iwk1qrv wrote on November 16, 2022 at 5:06 AM

#542,638

Replying to Dr_Singularity (#540,647)

Linear speed up in training time, not necessarily in performance. Just wanted to mention that as it's an important distinction.

visarga t1_iwkbncq wrote on November 16, 2022 at 6:59 AM

#543,097

Replying to 94746382926 (#542,638)

One Cerebras chip is about 100 top GPUs in speed but in memory it only handles 20B weights, they mention GPT-NeoX 20B. They need to stack 10 of these to train GPT-3.

AsuhoChinami t1_iwma8zs wrote on November 16, 2022 at 6:16 PM

#546,874

Replying to Rakshear (#541,641)

The beginning of what?

agorathird t1_iwmfs3j wrote on November 16, 2022 at 6:52 PM

#547,198

Replying to AsuhoChinami (#546,874)

*points toward subreddit name*

Rakshear t1_iwmo55s wrote on November 16, 2022 at 7:47 PM

#547,781

Replying to ihateshadylandlords (#541,654)

If it can be scaled for mass production we don’t need ai to run on a cloud based structure I think, faster also equals cheaper.

lovesdogsguy t1_iwnlavk wrote on November 16, 2022 at 11:31 PM

#549,693

Replying to agorathird (#547,198)

Where am I??

agorathird t1_iwnwyxp wrote on November 17, 2022 at 1:02 AM

#550,369

Replying to lovesdogsguy (#549,693)

It is the middle ground between light and shadow, between science and superstition, and it lies between the pit of man's fears and the summit of his knowledge. This is the dimension of imagination. It is an area which we call r\singularity

lovesdogsguy t1_iwnybtf wrote on November 17, 2022 at 1:13 AM

#550,464

Replying to agorathird (#550,369)

lol

AsuhoChinami t1_iws4nvm wrote on November 17, 2022 at 10:56 PM

#559,784