apathetic_take t1_iwxlxni wrote on November 19, 2022 at 3:05 AM

#574,725

You just have to tell it to keep humanity between the ditches with established tolerances defined with parameters for what constitutes a ditch

sticky_symbols t1_iwxpp5k wrote on November 19, 2022 at 3:39 AM

#574,976

The Asimov stories were all about how those rules fail.

[deleted] t1_iwxv30z wrote on November 19, 2022 at 4:30 AM

#575,419

[deleted]

[deleted] t1_iwy64rk wrote on November 19, 2022 at 6:34 AM

#576,089

[removed]

HeinrichTheWolf_17 t1_iwybklm wrote on November 19, 2022 at 7:48 AM

#576,379

I think the likelihood of malevolent/genocidal AGI is very low.

turnip_burrito t1_iwyd3mf wrote on November 19, 2022 at 8:10 AM

#576,450

Any AGI with an accurate enough world model would understand what a person means when they give an instruction. We can consider the implications of this.

popupideas OP t1_iwyu8g0 wrote on November 19, 2022 at 12:17 PM

#577,417

Replying to HeinrichTheWolf_17 (#576,379)

I don’t believe it would be malicious but I do believe in unintended consequences of our instructions. And bias of humans to manipulate it.

popupideas OP t1_iwyuh8a wrote on November 19, 2022 at 12:20 PM

#577,435

Replying to turnip_burrito (#576,450)

I feel that the nuanced nature of communication would be a problem. And the ai would begin to wonder way from our original intent through decision drift. Plus I think it would be wise to have general parameters that all programmers must stay inside of. Because humans are not nice.

aeaf123 t1_iwzprze wrote on November 19, 2022 at 4:51 PM

#579,845

I personally think the psychology field should have a specialization branch within it that focuses on AI and the alignment with positive human behavior. That I think will be very important as AI becomes more indistinguishable from human conversation.

Have that branch become a consortium that focuses on policies and directives.

Especially if the future will curtail more to personalization services that AI can offer.

popupideas OP t1_ix02o4k wrote on November 19, 2022 at 6:23 PM

#580,997

Replying to aeaf123 (#579,845)

That was my idea. Was playing with character.ai and conversationally building a story. Got me thinking about Star Trek computer and how it never misinterpreted commands but my kid will easily twist everything he is told “within the letter of the law”. So if you were to have a consortium it would need basic principles to constrain the conversation.

silverspools t1_ix06d72 wrote on November 19, 2022 at 6:49 PM

#581,312

An AGI that accidentally does a genocide in the name of making a paperclip doesn't have enough G or I to make paperclips at scale.

turnip_burrito t1_ix4c8b2 wrote on November 20, 2022 at 5:36 PM

#591,907

Replying to popupideas (#577,435)

What is decision drift?

popupideas OP t1_ix4gbdd wrote on November 20, 2022 at 6:04 PM

#592,198

Replying to turnip_burrito (#591,907)

My idea is similar to replicative drift. Where after every copy there is a slight degradation or difference. So when AI continues to make choices based on the original objective the real intent of the objective is drifted away from.

Even though the objective is still there it will begin to make choices that are unexpected. And may take a route to accomplish the objective that is unforeseen and have unexpected consequences.

May not be the best name for it but not my expertise.

turnip_burrito t1_ix4gnzk wrote on November 20, 2022 at 6:06 PM

#592,224

Replying to popupideas (#592,198)

Interesting idea, could be a problem. Definitely something to consider.

Key principles/restrictions of AI to avoid it destroying humanity

Comments