Submitted by [deleted] t3_11tmu9u in MachineLearning
Single_Blueberry t1_jcjvh6o wrote
Reply to comment by Available_Lion_652 in [D] GPT-4 is really dumb by [deleted]
Again, can't find a reliable source for that.
I personally doubt that GPT-4 is significantly larger than GPT 3.x, simply because that would also further inflate inference cost, which you generally want to avoid in a product (as opposed to a research feat).
Better architecture, better RLHF, more and better train data, more train compute? Seems all reasonable.
Orders of magnitudes larger again? Don't think so.
Viewing a single comment thread. View all comments