Viewing a single comment thread. View all comments

currentscurrents t1_j3epeo7 wrote

Transformers are just deep learning with attention.

And attention is just another neural network telling the first one where to look.

4

junetwentyfirst2020 t1_j3ftm38 wrote

That makes sense. Convolutional neural networks were just deep learning with convolution.

1