currentscurrents t1_j4s2n9t wrote
Reply to comment by _Arsenie_Boca_ in [P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers by bo_peng
It looks like he goes into a lot more detail on his github.
Viewing a single comment thread. View all comments