Akimbo333 t1_j77w0vm wrote on February 4, 2023 at 7:23 PM

Oh ok. What exactly is instructGPT?

Nmanga90 t1_j77zkpf wrote on February 4, 2023 at 7:48 PM

InstructGPT is GPT-3 fine tuned to follow instructions, and is now the flagship GPT3, and the newest davinci model is instructGPT. ChatGPT is based on instructGPT and further fine tuned for dialog.

Akimbo333 t1_j77zzkk wrote on February 4, 2023 at 7:51 PM

But I don't Understand. How does following directions make it better?

Nmanga90 t1_j784rkz wrote on February 4, 2023 at 8:24 PM

What exactly don’t you understand?

Following instructions makes it better because these models are by nature predictive. They don’t understand what you are saying, and are created to predict the next text after the input. By nature, the models basically have an implicit prompt that says “what follows this input:”. This is much less useful than following instructions, because in the real world, there is less money/productivity to be gained by predicting the next text sequence, and more to be gained by completing tasks that you ask it to.

Akimbo333 t1_j7853be wrote on February 4, 2023 at 8:26 PM

Oh ok. I see now thanks for explaining. Maybe they'll make a solveGPT that can actually solve things someday lol!

Nmanga90 t1_j785o7a wrote on February 4, 2023 at 8:31 PM

Haha, AWS actually just released one of these 2 days ago that’s waaaaay smaller but actually outperforms GPT-3 on reasoning tasks.

Here is the link: https://arxiv.org/abs/2302.00923

Akimbo333 t1_j786lmu wrote on February 4, 2023 at 8:37 PM

Wow that's so cool! To get Proto AGI, we definitely need an all in one multimodal LLM

Nmanga90 t1_j78du1g wrote on February 4, 2023 at 9:29 PM

Just out of curiosity, what is your education on the subject? I find it kind of strange or I guess inconsistent that you’re talking about multimodal LLMs and their necessity, but don’t know about OPT, InstructGPT, or why an Instruct model would be better than a predictive model

Akimbo333 t1_j78i2ia wrote on February 4, 2023 at 10:00 PM

I have a limited programming back ground. But I was out of date with GPT models. But for a time I thought that it would be better to have a predictive model that can plan ahead. Atleast that was my mindset.

Infinite police

Akimbo333 t1_j760xc5 wrote on February 4, 2023 at 9:43 AM

Nmanga90 t1_j77nr39 wrote on February 4, 2023 at 6:26 PM

Akimbo333 t1_j77seu7 wrote on February 4, 2023 at 6:58 PM

Nmanga90 t1_j77vdw4 wrote on February 4, 2023 at 7:18 PM