Submitted by xutw21 t3_ybzh5j in singularity
rePAN6517 t1_itmxzn7 wrote
Reply to comment by sheerun in Large Language Models Can Self-Improve by xutw21
No that's not really a good analogy here. The model's text outputs are the inputs to a round of fine tuning. The authors of the paper didn't specify if they did this for just 1 loop or tried many loops, but since they didn't specify I think they mean they just did 1 loop.
Viewing a single comment thread. View all comments