Puzzleheaded_Acadia1 t1_jdlw72w wrote on March 25, 2023 at 11:02 AM

Can someone explain to me what this paper is about

yaosio t1_jdomvtr wrote on March 25, 2023 at 11:52 PM

I think they give GPT-4 a task, GPT-4 attempts to complete it and is told if it worked or not, then GPT-4 looks at what happened and determines why it failed, and then tries again with this new knowledge. This is all done through natural language prompts, the model isn't being changed.

I saw somebody else in either this sub or /r/openai using a very similar method to get GPT-4 to write and deploy a webpage that could accept valid email addresses. Of course, I can't find it, and neither can Bing Chat, so maybe I dreamed it. I distinctly remember asking if it could do QA, and then the person asked what I meant, and I said have it check for bugs. I post a lot so I can't find it in my post history.

I remember the way it worked was they gave it the task, then GPT-4 would write out what it was going to do, what it predicted would happen, write the code, and then check if what it did worked. If it didn't work it would write out why it didn't work, plan again, then act again. So it went plan->predict->act->check->plan. This successfully worked as it went from nothing to a working and deployed webpage without any human intervention other than setting the task.

pm_me_your_pay_slips t1_jdv748e wrote on March 27, 2023 at 1:15 PM

this is literally what gdb did during the GPT-4 launch livestream

sneakpeekbot t1_jdomx0y wrote on March 25, 2023 at 11:52 PM

Here's a sneak peek of /r/OpenAI using the top posts of the year!

#1: meme | 110 comments
#2: ChatGPT transforming data and running SQL queries | 119 comments
#3: [Official] ChatGPT now supports plugins!!! | 270 comments

[deleted] t1_jdrjkx4 wrote on March 26, 2023 at 5:04 PM

[removed]

kim_en t1_jdmu877 wrote on March 25, 2023 at 4:00 PM

me too