cipri_tom
cipri_tom t1_jcjr74y wrote
Reply to comment by gliptic in [R] RWKV 14B ctx8192 is a zero-shot instruction-follower without finetuning, 23 token/s on 3090 after latest optimization (16G VRAM is enough, and you can stream layers to save more VRAM) by bo_peng
first vey is not vey ! :)
cipri_tom t1_jcjeehj wrote
Reply to [R] RWKV 14B ctx8192 is a zero-shot instruction-follower without finetuning, 23 token/s on 3090 after latest optimization (16G VRAM is enough, and you can stream layers to save more VRAM) by bo_peng
This is great! It just needs a name that's as great as the work
RWKV is a tongue twister. How about Ruckus?
cipri_tom t1_j8qppo1 wrote
Babka? Where is this from?
We have a super similar thing in Romania, we call it cozonac
Anyhow, looks yum
cipri_tom t1_iy5fd5w wrote
Reply to Prima Luce, me, 3D, 2022 by losbadhombres
Newb question: if it's 3D, does it mean you can turn the camera to another angle?
It's interesting that you chose this angle, not a cm more to the right or left.
cipri_tom t1_jcjr8rn wrote
Reply to comment by cipri_tom in [R] RWKV 14B ctx8192 is a zero-shot instruction-follower without finetuning, 23 token/s on 3090 after latest optimization (16G VRAM is enough, and you can stream layers to save more VRAM) by bo_peng
Man, ChatRNN
The stars would be pouring over the repo if you named it ChatRNN. People love an antagonist, and "going back to the old days" and proving that was better