Jump to main content Jump to sidebar

Forums
Wiki

Log in
Sign up

/f/MachineLearning

[P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM

github.com

Submitted by Amazing_Painter_7692 t3_11pmz69 on March 12, 2023 at 7:13 PM in MachineLearning

51 comments

320

Viewing a single comment thread. View all comments

wirefire07 t1_jcgx51q wrote on March 16, 2023 at 7:15 PM

Already heared about this project? https://github.com/ggerganov/llama.cpp -> It's very fast!!

Permalink

1

0 points (+0, −0)

Short URL:

http://3.13.36.195:9999/120502

MachineLearning

t5_2r3gv

Created October 1, 2022
Subscribe via RSS

Toolbox

Bans
Moderation log

Running Postmill