[R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python Submitted by bo_peng t3_11iwt1b on March 5, 2023 at 1:11 PM in MachineLearning 26 comments 63
Viewing a single comment thread. View all comments