Submitted by IamTimNguyen t3_105v7el in MachineLearning
cdsmith t1_j3ev3je wrote
Reply to comment by ThatInternetGuy in [R] Greg Yang's work on a rigorous mathematical theory for neural networks by IamTimNguyen
This is definitely a theory presentation, though it does end with some applications to hyperparameter transfer when scaling model size. But if your main experience with ML is building models and applications, I'm not surprised it looks unfamiliar.
That being said, though, give it a chance if you're interested. Some parts of the outline didn't look familiar to me either, but the video is well-made and stops to explain most of the background knowledge. And you can always gloss over the bits you don't understand.
Viewing a single comment thread. View all comments