Submitted by mettle t3_10oyllu in MachineLearning
Blutorangensaft t1_j6mdw93 wrote
Reply to comment by andreichiffa in [Discussion] ChatGPT and language understanding benchmarks by mettle
Is the critic used for fine-tuning or as a part of the loss function during training?
andreichiffa t1_j6mojfv wrote
Most likely as a post-processor, along the lines of guided generation; pretty much the GeDi proposed by Salesforce in 2020.
Viewing a single comment thread. View all comments