[R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers Submitted by currentscurrents t3_10ly7rw on January 26, 2023 at 6:06 PM in MachineLearning 32 comments 235
Acceptable-Cress-374 t1_j62ql8t wrote on January 27, 2023 at 8:00 AM Reply to comment by lookatmetype in [R] Why Can GPT Learn In-Context? Language Models Secretly Perform Gradient Descent as Meta-Optimizers by currentscurrents Stable diffusion with proper hands? :) Permalink Parent 9
Viewing a single comment thread. View all comments