IluvBsissa t1_j9j9ml9 wrote on February 22, 2023 at 11:01 AM

Reply to comment by astonzhang in [R] Multimodal Chain-of-Thought Reasoning in Language Models - Amazon Web Services Zhuosheng Zhang et al - Outperforms GPT-3.5 by 16% (75%->91%) and surpasses human performance on ScienceQA while having less than 1B params! by Singularian2501

Dr. Zhang, thank you so much. Please can you tell us more about your model's performance ? How would it do on standard MMLU ? Can it be improved by increasing parameters count ? The paper didn't mention if the human testers were average human or experts ?

astonzhang t1_j9sd3mw wrote on February 24, 2023 at 5:17 AM

The human performance was taken from the paper from Lu et al.