[R] RWKV-4 7B release: an attention-free RNN language model matching GPT-J performance (14B training in progress) Submitted by bo_peng t3_yxt8sa on November 17, 2022 at 3:32 PM in MachineLearning 22 comments 172
Sylv__ t1_iww823x wrote on November 18, 2022 at 8:39 PM Plot twist: the model getting integrated in transformers lib ( ͡° ͜ʖ ͡°) Permalink 3 bo_peng OP t1_iwwapqr wrote on November 18, 2022 at 8:57 PM What we have at this moment: https://github.com/huggingface/transformers/issues/17230 Permalink Parent 5
bo_peng OP t1_iwwapqr wrote on November 18, 2022 at 8:57 PM What we have at this moment: https://github.com/huggingface/transformers/issues/17230 Permalink Parent 5
Viewing a single comment thread. View all comments