[R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers Submitted by currentscurrents t3_10ly7rw on January 26, 2023 at 6:06 PM in MachineLearning 32 comments 235
[deleted] t1_j61h1lt wrote on January 27, 2023 at 1:06 AM Reply to comment by currentscurrents in [R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers by currentscurrents [deleted] Permalink Parent 1
Viewing a single comment thread. View all comments