Taenk t1_jbzaeau wrote
Reply to comment by kkg_scorpio in [P] Discord Chatbot for LLaMA 4-bit quantized that runs 13b in <9 GiB VRAM by Amazing_Painter_7692
Isn't 1-bit quantisation qualitatively different as you can do optimizations only available if the parameters are fully binary?
AsIAm t1_jc168cw wrote
It is. But that doesn't mean 1-bit neural nets are impossible. Even Turing himself toyed with such networks – https://www.npl.co.uk/getattachment/about-us/History/Famous-faces/Alan-Turing/80916595-Intelligent-Machinery.pdf?lang=en-GB
Viewing a single comment thread. View all comments