36% of HellaSwag benchmark contains errors [D] Submitted by BB4evaTB12 t3_zff5mh on December 7, 2022 at 9:51 PM in MachineLearning 6 comments 33
gwern t1_izezqic wrote on December 8, 2022 at 4:59 PM https://news.ycombinator.com/item?id=33874955 Permalink 2
Viewing a single comment thread. View all comments