_Arsenie_Boca_ t1_j4rxdt8 wrote
Reply to comment by bo_peng in [P] RWKV 14B Language Model & ChatRWKV : pure RNN (attention-free), scalable and parallelizable like Transformers by bo_peng
Is there some more detailed description? Would be interesting to read about these lots of new ideas :)
currentscurrents t1_j4s2n9t wrote
It looks like he goes into a lot more detail on his github.
Viewing a single comment thread. View all comments