Dankmemexplorer t1_iymieav wrote on December 2, 2022 at 2:34 PM

#845,979

for a sense of scale, GPT-NeoX, a 20 billion parameter model, requires ~45GB of vram to run. gpt-3 davinci is 175 billion parameters.

unless these models can be pared down somehow (unlikely, the whole point of training these huge models is because their performance scales with size), we will have to wait a decade or two for consumer electronics to catch up

aero_oliver2 OP t1_iymivor wrote on December 2, 2022 at 2:38 PM

#846,005

Replying to Dankmemexplorer (#845,979)

Interesting. So you’re saying rather than adjusting the models to work on current devices the better option is actually designing the devices to work with these models ?

Dankmemexplorer t1_iymjsty wrote on December 2, 2022 at 2:45 PM

#846,057

Replying to aero_oliver2 (#846,005)

running the full gpt-3 on a laptop would be like running crysis 3 on a commodore 64. you cant pare it down enough to run without ruining it

Deep-Station-1746 t1_iymmi79 wrote on December 2, 2022 at 3:05 PM

#846,178

Replying to Dankmemexplorer (#845,979)

> we will have to wait a decade or two

The best I can is 4 years. Take it or leave it.

Dankmemexplorer t1_iymsbgo wrote on December 2, 2022 at 3:44 PM

#846,414

Replying to Deep-Station-1746 (#846,178)

my current gpu is 4 years old 😖

state of the art has gotten a lot better since then but not that much better

StChris3000 t1_iyn5rdm wrote on December 2, 2022 at 5:14 PM

#846,988

There are advances such as quantization that have enabled edge devices to run some pretty spicy models so i wouldn’t be surprised if we got it down to within gaming computers reach pretty soon. Also Google research revealed that GPT-3 was not trained efficiently and has too many parameters. So a newly designed model with way fewer parameters trained on the same data should perform as well as GPT-3.

(I am only a machine learning enthusiast and not an expert so take everything I say with a grain of salt)

[D] What advances need to happen for something like gpt3 to be able to run on consumer devices and laptops locally? Is it even a possibility?

Comments