Viewing a single comment thread. View all comments

visarga t1_jdloh24 wrote on March 25, 2023 at 9:12 AM

Since RLHF finetuning is short, you can continue training your original model and RLHF again.