Nextil t1_jb1sg1c wrote
Reply to comment by Art10001 in [R] RWKV (100% RNN) can genuinely model ctx4k+ documents in Pile, and RWKV model+inference+generation in 150 lines of Python by bo_peng
I think they mean with offloading/streaming you need 3GB minimum, but it's much slower.
Viewing a single comment thread. View all comments