Spire_Citron t1_ja0457j wrote
Reply to comment by TheRidgeAndTheLadder in Likelihood of OpenAI moderation flagging a sentence containing negative adjectives about a demographic as 'Hateful'. by grungabunga
The training data is massive and usually not carefully curated because they need so much of it.
starstruckmon t1_ja1102i wrote
He's talking about the human preference data used for RHLF fine-tuning ( which is what makes ChatGPT from GPT3 ). It's not really that massive.
Viewing a single comment thread. View all comments