Submitted by Liberty2012 t3_11ee7dt in singularity
Surur t1_jaem8nr wrote
Reply to comment by Liberty2012 in Is the intelligence paradox resolvable? by Liberty2012
It is interesting to me that
a) its possible to teach a LLM to be honest when we catch it in a lie.
b) if we ever get to the point where we can not detect a lie (eg. novel information) the AI is incentivised to lie every time.
Viewing a single comment thread. View all comments