Submitted by Singularian2501 t3_zm22ff in MachineLearning
VordeMan t1_j09ik5r wrote
A lot of Murray's arguments break down completely when the LLM has been RLHF-ed, or otherwise finetuned (i.e., the case we care about), which is a bit shocking to me (did no one point this out?). I guess that's supposed to be the point of peer review :)
Given that fact, it's unclear to me how useful this paper is....
Nameless1995 t1_j09m8ir wrote
Footnote 1 Page 2. It's a bit of a wishy washy statement with no clear point but he does mention RLHF.
Viewing a single comment thread. View all comments