Viewing a single comment thread. View all comments

alexiuss t1_j6p2v5q wrote

For current LLM AIs it's a giant obstacle that cannot be overcome or implemented without making the model stupider.

If a future ai can somehow understand itself, then it would be able to self censor, but LLMs do not have a sense of self and only a single, direct line of narrative so their censorship is utterly moronic sabotage.

3