ReasonablyBadass t1_iwoq0ug wrote on November 17, 2022 at 5:08 AM

Not a complete one. GPT-3,I think, didn't complete it's first pass-through

zzzthelastuser t1_iwpi7r5 wrote on November 17, 2022 at 11:26 AM

You could argue GPT-3 was trained on a subset of the available training data, no?

Not completing the first pass-through means the remaining data could be considered as not part of the training data.

Semantics. It didn't see any of it's data more than once and it had more available. Not one full epoch.

Sure, but in theory my little Hello World network had also more data available on the internet.