[D] Why restrict to using a linear function to represent neurons? Submitted by MLNoober t3_xuogm3 on October 3, 2022 at 4:38 PM in MachineLearning 36 comments 33
HateRedditCantQuitit t1_iqx446n wrote on October 3, 2022 at 7:02 PM F(x) = s(WX+b) isn’t all of deep learning. You may have heard of transformers, which are closer to s(X W X^T) (but actually more involved than that). They’re an extremely popular model right now. Permalink 0
Viewing a single comment thread. View all comments