Viewing a single comment thread. View all comments

Jean-Porte t1_ize03mv wrote

A good thing with bigbench is that google performed nice human evaluations, and they report the results of the best humans as well as the average accuracy

2