Submitted by notyourregularnerd t3_101qbfl in MachineLearning
jokokokok t1_j2q5htz wrote
Reply to comment by 99posse in [D] life advice to relatively late bloomer ML theory researcher. by notyourregularnerd
>One interesting, recent observation is that as you scale models up, the specifics of the architecture no longer matter much and pretty much anything reasonable will work well enough
Could you share some more information on this - is it from a paper? Would like to read more
Viewing a single comment thread. View all comments