Viewing a single comment thread. View all comments

rshah4 t1_je0crbz wrote on March 28, 2023 at 2:38 PM

Reply to comment by matus_pikuliak in [P] ChatGPT Survey: Performance on NLP datasets by matus_pikuliak

I agree, these baselines are useful. I think we should push for is more human baselines for these benchmarks. That would help figure out how far we have left to go.