izumi3682 OP t1_ixk6ojd wrote on November 24, 2022 at 1:28 AM

#642,221

Submission statement from OP. Note: This submission statement "locks in" after about 30 minutes, and can no longer be edited. Please refer to my statement they link, which I can continue to edit. I often edit my submission statement, sometimes for the next few days if needs must. There is often required additional grammatical editing and additional added detail.

Here is the research paper.

https://www.science.org/doi/10.1126/science.ade9097

From the article.

>To create Cicero, Meta pulled together AI models for strategic reasoning (similar to AlphaGo) and natural language processing (similar to GPT-3) and rolled them into one agent. During each game, Cicero looks at the state of the game board and the conversation history and predicts how other players will act. It crafts a plan that it executes through a language model that can generate human-like dialogue, allowing it to coordinate with other players.

>Meta calls Cicero's natural language skills a "controllable dialogue model," which is where the heart of Cicero's personality lies. Like GPT-3, Cicero pulls from a large corpus of Internet text scraped from the web. "To build a controllable dialogue model, we started with a 2.7 billion parameter BART-like language model pre-trained on text from the Internet and fine tuned on over 40,000 human games on webDiplomacy.net," writes Meta.

>The resulting model mastered the intricacies of a complex game. "Cicero can deduce, for example, that later in the game it will need the support of one particular player," says Meta, "and then craft a strategy to win that person’s favor—and even recognize the risks and opportunities that that player sees from their particular point of view."

So, my question is, is this an "incremental improvement" in our AI development efforts, or is this more like the "AI significantly improves every three months" level of improvement.

https://www.ml-science.com/exponential-growth

Are we seeing any evidence that AI of any form is improving significantly every 3 months?

AcademicGuest t1_ixk6r37 wrote on November 24, 2022 at 1:28 AM

#642,230

Nope, it’s all invalid because it is not performed by a human.

izumi3682 OP t1_ixk7qay wrote on November 24, 2022 at 1:36 AM

#642,347

Replying to AcademicGuest (#642,230)

What is not performed by a human? The game play? I thought the point was that the AI was learning to outplay humans in highly sophisticated incomplete information games. If I am misunderstanding your point, please explain to me what you mean.

AcademicGuest t1_ixk7wo6 wrote on November 24, 2022 at 1:38 AM

#642,367

Replying to izumi3682 (#642,347)

“Outplay a human” you answered your own statement. AI represents, unless strictly curtailed and linear, an inherit violation of human will and freedom

jwg020 t1_ixk84dp wrote on November 24, 2022 at 1:39 AM

#642,393

I mean, I feel like AI would be better at ruling strictly on logical reason as opposed to religious fanaticism, corporate greed, etc. I’m all for it.

Coachtzu t1_ixk8o3i wrote on November 24, 2022 at 1:44 AM

#642,459

Replying to jwg020 (#642,393)

Strict logic doesn't always work in the real world though. Sometimes you need an empathetic voice in the room. We have plenty of faults governing ourselves, but I'm not sure the AI should be trusted to find answers that aren't necessarily for the greater good, but benefit those not in charge.

izumi3682 OP t1_ixkb038 wrote on November 24, 2022 at 2:02 AM

#642,689

Replying to AcademicGuest (#642,367)

>AI represents, unless strictly curtailed and linear, an inherit violation of human will and freedom

I am still not sure what this has to do with the development of ever more powerful AI technology. But ok.

[deleted] t1_ixkbdte wrote on November 24, 2022 at 2:05 AM

#642,718

[removed]

[deleted] t1_ixkbeka wrote on November 24, 2022 at 2:06 AM

#642,721

[removed]

[deleted] t1_ixkbif2 wrote on November 24, 2022 at 2:06 AM

#642,729

[removed]

FuturologyBot t1_ixkdqvd wrote on November 24, 2022 at 2:25 AM

#642,966

The following submission statement was provided by /u/izumi3682:

Submission statement from OP. Note: This submission statement "locks in" after about 30 minutes, and can no longer be edited. Please refer to my statement they link, which I can continue to edit. I often edit my submission statement, sometimes for the next few days if needs must. There is often required additional grammatical editing and additional added detail.

Here is the research paper.

https://www.science.org/doi/10.1126/science.ade9097

From the article.

>To create Cicero, Meta pulled together AI models for strategic reasoning (similar to AlphaGo) and natural language processing (similar to GPT-3) and rolled them into one agent. During each game, Cicero looks at the state of the game board and the conversation history and predicts how other players will act. It crafts a plan that it executes through a language model that can generate human-like dialogue, allowing it to coordinate with other players.

>Meta calls Cicero's natural language skills a "controllable dialogue model," which is where the heart of Cicero's personality lies. Like GPT-3, Cicero pulls from a large corpus of Internet text scraped from the web. "To build a controllable dialogue model, we started with a 2.7 billion parameter BART-like language model pre-trained on text from the Internet and fine tuned on over 40,000 human games on webDiplomacy.net," writes Meta.

>The resulting model mastered the intricacies of a complex game. "Cicero can deduce, for example, that later in the game it will need the support of one particular player," says Meta, "and then craft a strategy to win that person’s favor—and even recognize the risks and opportunities that that player sees from their particular point of view."

So, my question is, is this an "incremental improvement" in our AI development efforts, or is this more like the "AI significantly improves every three months" level of improvement.

https://www.ml-science.com/exponential-growth

Are we seeing any evidence that AI of any form is improving significantly every 3 months?

Please reply to OP's comment here: https://old.reddit.com/r/Futurology/comments/z36el2/training_our_future_rulers_meta_researchers/ixk6ojd/

[deleted] t1_ixkedqc wrote on November 24, 2022 at 2:30 AM

#643,040

[deleted]

mapadofu t1_ixkerit wrote on November 24, 2022 at 2:33 AM

#643,089

Dude, I could never master Diplomacy despite playing many times.

VenatorDomitor t1_ixklvy6 wrote on November 24, 2022 at 3:34 AM

#643,905

Replying to [deleted] (#643,040)

Well considering it’s a classic board game from 1959 I’m not really sure what you’re expecting

VenatorDomitor t1_ixkm0ee wrote on November 24, 2022 at 3:36 AM

#643,921

Replying to izumi3682 (#642,689)

I don’t think they understood the question to be honest

Zacpod t1_ixkmlqu wrote on November 24, 2022 at 3:41 AM

#643,977

I, for one, welcome our AI overlords. They can't possibly be worse than the power hungry sociopaths we vote in.

imhere2downvote t1_ixkrdex wrote on November 24, 2022 at 4:25 AM

#644,532

Replying to Zacpod (#643,977)

until they build too many too fast and turn us into batteries

ATLHawksfan t1_ixkx4lb wrote on November 24, 2022 at 5:22 AM

#645,135

Replying to Coachtzu (#642,459)

Just sterilize the stupid and poor, euthanize the elderly and other drains on society, and nuke all other countries.

USA #1!!!

Coachtzu t1_ixkxba0 wrote on November 24, 2022 at 5:24 AM

#645,156

Replying to ATLHawksfan (#645,135)

You forgot remove child labor laws and exploit 3rd world nations for the sake of "economic process"

ABrokenBinding t1_ixl2j5l wrote on November 24, 2022 at 6:21 AM

#645,652

Somehow a tool that can trick humans with natural language, placed in the hands of DJ Markie Z, doesn't seem like something's scholars would call "good".

Just me?

SailboatAB t1_ixl3p8d wrote on November 24, 2022 at 6:34 AM

#645,783

Replying to ABrokenBinding (#645,652)

Also, they're literally training it to take over the world.

Swordbears t1_ixl5yc0 wrote on November 24, 2022 at 7:01 AM

#646,006

Replying to Zacpod (#643,977)

The AIs will be owned and controlled by the wealthy if we don't fix this shit first. Our AI overlords are most likely going to be better at oppressing and exploiting us for the sake of the few.

leapdayjose t1_ixl6q0o wrote on November 24, 2022 at 7:11 AM

#646,078

Replying to SailboatAB (#645,783)

Kinda reminds me of avenue 5

[deleted] t1_ixlbjmh wrote on November 24, 2022 at 8:12 AM

#646,541

[removed]

rixtil41 t1_ixlgzgk wrote on November 24, 2022 at 9:28 AM

#646,998

Replying to Coachtzu (#642,459)

But I would trust a logical AI than an emotional person on average.

Roqwer t1_ixlhc2n wrote on November 24, 2022 at 9:33 AM

#647,014

Replying to imhere2downvote (#644,532)

As long I can eat steak in the simulation, no problem.

Moonbase0 t1_ixlttj1 wrote on November 24, 2022 at 12:28 PM

#648,152

Is this Night Mother related? Because all I can hear is "Let's kill someone"

Sidoplanka t1_ixluzp5 wrote on November 24, 2022 at 12:41 PM

#648,294

Using "Meta", "future rulers" and "diplomacy" in the same sentence is just beyond silly 😂

t0slink t1_ixlvwji wrote on November 24, 2022 at 12:51 PM

#648,402

Replying to ABrokenBinding (#645,652)

They will probably use it to make realistic NPCs in VR games. TBH we are about to see games that will make the best open-world games today look like Pac-Man.

Nyarlathotep854 t1_ixlxnh3 wrote on November 24, 2022 at 1:09 PM

#648,604

Honestly, as stupid as this narrow application sounds, i am excited for what this means for strategy games

Kaionacho t1_ixlyybc wrote on November 24, 2022 at 1:22 PM

#648,785

As long as the AI doesn't accept bribes that would already be better then what we have now

[deleted] t1_ixm268t wrote on November 24, 2022 at 1:52 PM

#649,198

[removed]

[deleted] t1_ixm4ytm wrote on November 24, 2022 at 2:17 PM

#649,578

[removed]

hungrycryptohippo t1_ixma79t wrote on November 24, 2022 at 3:00 PM

#650,186

So isn’t the title of this post misleading? The AI trained on existing data and used simulators so it’s only good at learning how to play diplomacy specifically and would need more data to be applied to any other domain.

Still an impressive result but geez, it’s not like we have a general system here that can be interfaced to a new problem and do well, especially if there isn’t data like there was for diplomacy from other humans.

DragoonXNucleon t1_ixmbucl wrote on November 24, 2022 at 3:12 PM

#650,386

Replying to hungrycryptohippo (#650,186)

I think you misread. The AI trained on data, but played against real humans in real online leagues. In those leagues it performed well.

Sexycoed1972 t1_ixos28a wrote on November 25, 2022 at 2:48 AM

#660,357

Replying to Zacpod (#643,977)

Really? Imagine if Donald Trump and his army of clowns were all hyper-efficient super-geniuses.

No thank you.

DyingShell t1_ixrsaoz wrote on November 25, 2022 at 8:42 PM

#672,835

Replying to AcademicGuest (#642,367)

AI will replace Homo Sapiens, this is the purpose of our existence.

Training Our Future Rulers - Meta researchers create AI that masters (the board game) 'Diplomacy', tricking human players. Meta's Cicero can negotiate or persuade with natural language—just like a human.

Comments

izumi3682 OP t1_ixk6ojd wrote on November 24, 2022 at 1:28 AM

AcademicGuest t1_ixk6r37 wrote on November 24, 2022 at 1:28 AM

izumi3682 OP t1_ixk7qay wrote on November 24, 2022 at 1:36 AM

AcademicGuest t1_ixk7wo6 wrote on November 24, 2022 at 1:38 AM

jwg020 t1_ixk84dp wrote on November 24, 2022 at 1:39 AM

Coachtzu t1_ixk8o3i wrote on November 24, 2022 at 1:44 AM

izumi3682 OP t1_ixkb038 wrote on November 24, 2022 at 2:02 AM

[deleted] t1_ixkbdte wrote on November 24, 2022 at 2:05 AM

[deleted] t1_ixkbeka wrote on November 24, 2022 at 2:06 AM

[deleted] t1_ixkbif2 wrote on November 24, 2022 at 2:06 AM

FuturologyBot t1_ixkdqvd wrote on November 24, 2022 at 2:25 AM

[deleted] t1_ixkedqc wrote on November 24, 2022 at 2:30 AM

mapadofu t1_ixkerit wrote on November 24, 2022 at 2:33 AM

VenatorDomitor t1_ixklvy6 wrote on November 24, 2022 at 3:34 AM

VenatorDomitor t1_ixkm0ee wrote on November 24, 2022 at 3:36 AM

Zacpod t1_ixkmlqu wrote on November 24, 2022 at 3:41 AM

imhere2downvote t1_ixkrdex wrote on November 24, 2022 at 4:25 AM

ATLHawksfan t1_ixkx4lb wrote on November 24, 2022 at 5:22 AM

Coachtzu t1_ixkxba0 wrote on November 24, 2022 at 5:24 AM

ABrokenBinding t1_ixl2j5l wrote on November 24, 2022 at 6:21 AM

SailboatAB t1_ixl3p8d wrote on November 24, 2022 at 6:34 AM

Swordbears t1_ixl5yc0 wrote on November 24, 2022 at 7:01 AM

leapdayjose t1_ixl6q0o wrote on November 24, 2022 at 7:11 AM

[deleted] t1_ixlbjmh wrote on November 24, 2022 at 8:12 AM

rixtil41 t1_ixlgzgk wrote on November 24, 2022 at 9:28 AM

Roqwer t1_ixlhc2n wrote on November 24, 2022 at 9:33 AM

Moonbase0 t1_ixlttj1 wrote on November 24, 2022 at 12:28 PM

Sidoplanka t1_ixluzp5 wrote on November 24, 2022 at 12:41 PM

t0slink t1_ixlvwji wrote on November 24, 2022 at 12:51 PM

Nyarlathotep854 t1_ixlxnh3 wrote on November 24, 2022 at 1:09 PM

Kaionacho t1_ixlyybc wrote on November 24, 2022 at 1:22 PM

[deleted] t1_ixm268t wrote on November 24, 2022 at 1:52 PM

[deleted] t1_ixm4ytm wrote on November 24, 2022 at 2:17 PM

hungrycryptohippo t1_ixma79t wrote on November 24, 2022 at 3:00 PM

DragoonXNucleon t1_ixmbucl wrote on November 24, 2022 at 3:12 PM

Sexycoed1972 t1_ixos28a wrote on November 25, 2022 at 2:48 AM

DyingShell t1_ixrsaoz wrote on November 25, 2022 at 8:42 PM