WithoutReason1729 t1_j9jmd05 wrote on February 22, 2023 at 1:14 PM

Reply to comment by Neurogence in What. The. ***k. [less than 1B parameter model outperforms GPT 3.5 in science multiple choice questions] by Destiny_Knight

The catch is that it only outperforms large models in a narrow domain of study. It's not a general purpose tool like the really large models. That's still impressive though.

Ken_Sanne t1_j9jxg68 wrote on February 22, 2023 at 2:39 PM

Can It be fine tuned ?

WithoutReason1729 t1_j9jxy78 wrote on February 22, 2023 at 2:43 PM

You can tune it to another data set and probably get good results, but you have to have a nice, high quality data set to work with.

Ago0330 t1_j9lm5ty wrote on February 22, 2023 at 9:27 PM

I’m working on one that’s trained on JFK speeches and Bachlorette data to help people with conversation skills.

Gynophile t1_j9msb3s wrote on February 23, 2023 at 2:18 AM

I can't tell if this is a joke or real

Ago0330 t1_j9msg1r wrote on February 23, 2023 at 2:19 AM

It’s real. Gonna launch after GME moons

ihopeshelovedme t1_j9npl0j wrote on February 23, 2023 at 7:28 AM

Sounds like a viable AI implementation to me. I'll be your angel investor and throw some Doge your way or something.

Borrowedshorts t1_j9ka0ta wrote on February 22, 2023 at 4:34 PM

I don't think that's true, but I do believe it was finetuned on the specific dataset to achieve the SOTA result they did.