Submitted by General-Tart-6934 t3_xwzj0s in singularity
MasterFubar t1_irajhs5 wrote
Reply to comment by DungeonsAndDradis in The End of Programming by General-Tart-6934
An interesting thing about transformers is that they are simpler than the LSTMs that came before them. Problems like vanishing gradients set limits on how complex a neural network can be.
Viewing a single comment thread. View all comments