[P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers Submitted by bo_peng t3_10eh2f3 on January 17, 2023 at 4:54 PM in MachineLearning 19 comments 110
chip_0 t1_j5naknl wrote on January 24, 2023 at 5:11 AM Have you used RL with Human Feedback to fine-tune it yet? I have an idea about how to use RLHF without expensive human annotation. Let me know if you would like to collaborate on that! Permalink 1
Viewing a single comment thread. View all comments