[R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers Submitted by currentscurrents t3_10ly7rw on January 26, 2023 at 6:06 PM in MachineLearning 32 comments 235
curiousshortguy t1_j61silr wrote on January 27, 2023 at 2:32 AM Reply to comment by currentscurrents in [R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers by currentscurrents This is cool, thanks for sharing Permalink Parent 5
Viewing a single comment thread. View all comments