liyanjia92 OP t1_jdjwfnh wrote
Reply to comment by Puzzleheaded_Acadia1 in [P] ChatGPT with GPT-2: A minimum example of aligning language models with RLHF similar to ChatGPT by liyanjia92
It maybe better to submit an issue on github so that i can point you to some code with context. if you are talking my code, you need to convert the weights and load it into GPT class before running SFT training. otherwise there might be mismatch in weights and it could just output random stuff.
Viewing a single comment thread. View all comments