Viewing a single comment thread. View all comments

diviramon t1_iydklcc wrote on November 30, 2022 at 4:41 PM

Reply to comment by zaptrem in [R] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models - Massachusetts Institute of Technology and NVIDIA Guangxuan Xiao et al - Enables INT8 for LLM bigger than 100B parameters including OPT-175B, BLOOM-176B and GLM-130B. by Singularian2501

Nope - see my answer below.