Borrowedshorts t1_j9ka0ta wrote
Reply to comment by WithoutReason1729 in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight
I don't think that's true, but I do believe it was finetuned on the specific dataset to achieve the SOTA result they did.
Viewing a single comment thread. View all comments