MorallyDeplorable t1_jc1umt7 wrote
Reply to comment by Necessary_Ad_9800 in [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692
I'm not actually sure. I've just been chatting with people in an unrelated Discord's off topic channel about it.
I'd post some of what I've got from it but I have no idea what I'm doing with it and don't think what I'm getting would be decently representative of what it can actually do.
Viewing a single comment thread. View all comments