Submitted by austintackaberry t3_120usfk in MachineLearning
visarga t1_jdloh24 wrote
Reply to comment by light24bulbs in [R] Hello Dolly: Democratizing the magic of ChatGPT with open models by austintackaberry
Since RLHF finetuning is short, you can continue training your original model and RLHF again.
Viewing a single comment thread. View all comments