xrailgun t1_j9aq903 wrote on February 20, 2023 at 3:34 PM

Reply to comment by wywywywy in [D] Large Language Models feasible to run on 32GB RAM / 8 GB VRAM / 24GB VRAM by head_robotics

Did you test any larger and it wouldn't run?

Also, any comments so far among those? Good? Bad? Easy? Etc?

wywywywy t1_j9ar2tk wrote on February 20, 2023 at 3:40 PM

I did test larger but it didn't run. I can't remember which ones, probably GPT-J. I recently got a 3090 so I can load larger models now.

As for quality, my use case is simple (writing prompt to help with writing stories & articles) and nothing sophisticated, and they worked well. Until ChatGPT came along. I use ChatGPT instead now.

xrailgun t1_j9avboh wrote on February 20, 2023 at 4:09 PM

Thanks!

I wish model publishers would indicate rough (V)RAM requirements...

wywywywy t1_j9b2kqu wrote on February 20, 2023 at 4:57 PM

So, not scientific at all, but I've noticed that checkpoint file size * 0.6 is pretty close to actual VRAM requirement for LLM.

But you're right it'd be nice to have a table handy.