rePAN6517 t1_itmxzn7 wrote on October 24, 2022 at 8:44 PM

Reply to comment by sheerun in Large Language Models Can Self-Improve by xutw21

No that's not really a good analogy here. The model's text outputs are the inputs to a round of fine tuning. The authors of the paper didn't specify if they did this for just 1 loop or tried many loops, but since they didn't specify I think they mean they just did 1 loop.

sheerun t1_itnbjf5 wrote on October 24, 2022 at 10:16 PM

Child is fine tuning its brain with what adult say and vice versa

rePAN6517 t1_itng5zg wrote on October 24, 2022 at 10:51 PM

No, The model is fined tuned on it's own output. Don't try to anthropomorphize this.