Viewing a single comment thread. View all comments

curiousshortguy t1_j61silr wrote on January 27, 2023 at 2:32 AM

Reply to comment by currentscurrents in [R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers by currentscurrents

This is cool, thanks for sharing