Viewing a single comment thread. View all comments

starstruckmon t1_j3c8cop wrote

>It automates the final phase of RLHF (reinforcement learning from human feedback) by generating its own training examples from a bunch of rules, an "AI Constitution" so to speak.

I wish they'd just say this instead of all that "constitutional" , "three rules" nonsense.

Makes sense. Should be a lot more easier than RHLF through reward function. That's well known to be finicky as hell.

4