Submitted by Balance- t3_124eyso in MachineLearning
StellaAthena t1_je3tz04 wrote
Reply to comment by regalalgorithm in [N] OpenAI may have benchmarked GPT-4’s coding ability on it’s own training data by Balance-
I found this analysis incredibly unconvincing. They used a weaker standard for deduplication than is standard as well as a weaker analysis than the one they did for the GPT-3 paper.
Viewing a single comment thread. View all comments