InstructGPT is GPT-3 fine tuned to follow instructions, and is now the flagship GPT3, and the newest davinci model is instructGPT. ChatGPT is based on instructGPT and further fine tuned for dialog.

Akimbo333 t1_j77zzkk wrote on February 4, 2023 at 7:51 PM

#1,723,782

Replying to Nmanga90 (#1,723,762)

But I don't Understand. How does following directions make it better?

Nmanga90 t1_j784rkz wrote on February 4, 2023 at 8:24 PM

#1,723,994

Replying to Akimbo333 (#1,723,782)

What exactly don’t you understand?

Following instructions makes it better because these models are by nature predictive. They don’t understand what you are saying, and are created to predict the next text after the input. By nature, the models basically have an implicit prompt that says “what follows this input:”. This is much less useful than following instructions, because in the real world, there is less money/productivity to be gained by predicting the next text sequence, and more to be gained by completing tasks that you ask it to.

Akimbo333 t1_j7853be wrote on February 4, 2023 at 8:26 PM

#1,724,007

Replying to Nmanga90 (#1,723,994)

Oh ok. I see now thanks for explaining. Maybe they'll make a solveGPT that can actually solve things someday lol!

Nmanga90 t1_j785o7a wrote on February 4, 2023 at 8:31 PM

#1,724,031

Replying to Akimbo333 (#1,724,007)

Haha, AWS actually just released one of these 2 days ago that’s waaaaay smaller but actually outperforms GPT-3 on reasoning tasks.

Here is the link: https://arxiv.org/abs/2302.00923

Akimbo333 t1_j786lmu wrote on February 4, 2023 at 8:37 PM

#1,724,059

Replying to Nmanga90 (#1,724,031)

Wow that's so cool! To get Proto AGI, we definitely need an all in one multimodal LLM

Nmanga90 t1_j78du1g wrote on February 4, 2023 at 9:29 PM

#1,724,394

Replying to Akimbo333 (#1,724,059)

Just out of curiosity, what is your education on the subject? I find it kind of strange or I guess inconsistent that you’re talking about multimodal LLMs and their necessity, but don’t know about OPT, InstructGPT, or why an Instruct model would be better than a predictive model

Akimbo333 t1_j78i2ia wrote on February 4, 2023 at 10:00 PM

#1,724,606

Replying to Nmanga90 (#1,724,394)

I have a limited programming back ground. But I was out of date with GPT models. But for a time I thought that it would be better to have a predictive model that can plan ahead. Atleast that was my mindset.

Infinite police

Comments

Iffykindofguy t1_j73yd0e wrote on February 3, 2023 at 10:08 PM

srasmus97 t1_j73yuiw wrote on February 3, 2023 at 10:11 PM

Catablepas t1_j74dfbr wrote on February 3, 2023 at 11:53 PM

Akimbo333 t1_j74ow1z wrote on February 4, 2023 at 1:20 AM

Nmanga90 t1_j75hc66 wrote on February 4, 2023 at 5:24 AM

Akimbo333 t1_j75nsbq wrote on February 4, 2023 at 6:40 AM

Nmanga90 t1_j75uk2j wrote on February 4, 2023 at 8:11 AM

Akimbo333 t1_j760xc5 wrote on February 4, 2023 at 9:43 AM

Nmanga90 t1_j77nr39 wrote on February 4, 2023 at 6:26 PM

Akimbo333 t1_j77seu7 wrote on February 4, 2023 at 6:58 PM

Nmanga90 t1_j77vdw4 wrote on February 4, 2023 at 7:18 PM

Akimbo333 t1_j77w0vm wrote on February 4, 2023 at 7:23 PM

Nmanga90 t1_j77zkpf wrote on February 4, 2023 at 7:48 PM