Viewing a single comment thread. View all comments

Fallen-stars123 t1_j9yufow wrote

It seems that the new "idea" will be to train a lot more tokens, than just increasing the number of parameters, it seems that we were undertraining the models.

I imagine that GPT-4 will see a big jump in the amount of tokens trained.

2