Submitted by Neurosymbolic t3_1027qvv in MachineLearning

There were many great papers this past year in the field, but below (and in the video) are five papers that may have been overlooked. (YT video: https://www.youtube.com/watch?v=XnUf9twdchI)

Papers are linked as follows

What interesting papers do you think were overlooked this past year?

3

Comments

You must log in or register to comment.

gamerx88 t1_j2vzjfx wrote

"An empirical analysis of compute-optimal large language model training" by Deepmind, suggesting that LLMs are over-parameterized or under-trained (insufficient data used in training).

2