[R] RWKV-4 7B release: an attention-free RNN language model matching GPT-J performance (14B training in progress) Submitted by bo_peng t3_yxt8sa on November 17, 2022 at 3:32 PM in MachineLearning 22 comments 172
CKtalon t1_iwqk0b9 wrote on November 17, 2022 at 4:37 PM Reply to comment by ChuckSeven in [R] RWKV-4 7B release: an attention-free RNN language model matching GPT-J performance (14B training in progress) by bo_peng It’s written in the 2nd column (params) Permalink Parent 4
Viewing a single comment thread. View all comments