Amazing_Painter_7692 OP t1_jbz7hta wrote
Reply to comment by 3deal in [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692
It's the HuggingFace transformers module version of the weights from Meta/Facebook Research.
Viewing a single comment thread. View all comments