Viewing a single comment thread. View all comments

Spire_Citron t1_ja0457j wrote

The training data is massive and usually not carefully curated because they need so much of it.

4

starstruckmon t1_ja1102i wrote

He's talking about the human preference data used for RHLF fine-tuning ( which is what makes ChatGPT from GPT3 ). It's not really that massive.

1