Viewing a single comment thread. View all comments

HateRedditCantQuitit t1_iqx446n wrote

F(x) = s(WX+b) isn’t all of deep learning.

You may have heard of transformers, which are closer to s(X W X^T) (but actually more involved than that). They’re an extremely popular model right now.

0