Viewing a single comment thread. View all comments

Hyper1on t1_j43crwx wrote

That's the InstructGPT paper, which is right for ChatGPT, but Copilot is based on Codex, which does not use RLHF.

3