[R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python Submitted by bo_peng t3_11iwt1b on March 5, 2023 at 1:11 PM in MachineLearning 26 comments 63
Art10001 t1_jb176r8 wrote on March 5, 2023 at 5:32 PM Reply to comment by ThirdMover in [R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python by bo_peng Indeed. Permalink Parent 1
Viewing a single comment thread. View all comments