Submitted by fintechSGNYC t3_1095os9 in MachineLearning
Hyper1on t1_j43crwx wrote
Reply to comment by --algo in [D] Microsoft ChatGPT investment isn't about Bing but about Cortana by fintechSGNYC
That's the InstructGPT paper, which is right for ChatGPT, but Copilot is based on Codex, which does not use RLHF.
--algo t1_j43rpre wrote
Are you sure? This implies otherwise: https://openai.com/blog/instruction-following/
But maybe it's only for the non-codex models
Hyper1on t1_j43wyf3 wrote
You can see the full details here: https://beta.openai.com/docs/model-index-for-researchers
Copilot itself is the 12B Codex model, with further refinements.
Viewing a single comment thread. View all comments