Submitted by crap_punchline t3_10swlk9 in singularity
[removed]
Submitted by crap_punchline t3_10swlk9 in singularity
[removed]
It must be interesting to lie awake at night and think about how maybe one day you won't be able to harass minorities online.
Lol
I think open source llm will be a thing one day. There are already open source 6B and 20B LLMs
Meta has open sourced copies of GPT-3 that are up to 175B
Really? You got links?
Wow! When was this released?
Like 6 months ago or so. They have also announced plans to open source the “instruct” model for OPT-30B and 175B I think in the next 2 months
You mean ChatGPT open sourced?
Not exactly but close. ChatGPT is instructGPT fine tuned for dialog. You could make your own version, but it would be pretty expensive
Oh ok. What exactly is instructGPT?
InstructGPT is GPT-3 fine tuned to follow instructions, and is now the flagship GPT3, and the newest davinci model is instructGPT. ChatGPT is based on instructGPT and further fine tuned for dialog.
But I don't Understand. How does following directions make it better?
What exactly don’t you understand?
Following instructions makes it better because these models are by nature predictive. They don’t understand what you are saying, and are created to predict the next text after the input. By nature, the models basically have an implicit prompt that says “what follows this input:”. This is much less useful than following instructions, because in the real world, there is less money/productivity to be gained by predicting the next text sequence, and more to be gained by completing tasks that you ask it to.
Oh ok. I see now thanks for explaining. Maybe they'll make a solveGPT that can actually solve things someday lol!
Haha, AWS actually just released one of these 2 days ago that’s waaaaay smaller but actually outperforms GPT-3 on reasoning tasks.
Here is the link: https://arxiv.org/abs/2302.00923
Wow that's so cool! To get Proto AGI, we definitely need an all in one multimodal LLM
Just out of curiosity, what is your education on the subject? I find it kind of strange or I guess inconsistent that you’re talking about multimodal LLMs and their necessity, but don’t know about OPT, InstructGPT, or why an Instruct model would be better than a predictive model
I have a limited programming back ground. But I was out of date with GPT models. But for a time I thought that it would be better to have a predictive model that can plan ahead. Atleast that was my mindset.
Iffykindofguy t1_j73yd0e wrote
No one is going to stop you from offending anyone holy shittttttttttttttttttttt get off the cross lol