farmingvillein t1_j12s4mn wrote
Reply to comment by Dankmemexplorer in [R] Nonparametric Masked Language Modeling - MetaAi 2022 - NPM - 500x fewer parameters than GPT-3 while outperforming it on zero-shot tasks by Singularian2501
unfortunately still really slow (for now) to run, however:
> the speed of NPM is still on par with the speed of significantly larger parametric models that NPM outperforms
Dankmemexplorer t1_j13k11f wrote
aint that just the way
yaosio t1_j15h0xa wrote
They also say there's room for improvement but they didn't explore that in this paper. Just think, one day we'll have the power of the sun GPT-3 in the palm of our hand. Could be really soon, could be far away, but it's coming.
ItsTheUltimateBob t1_j16z2v4 wrote
Hopefully, they'll be beyond GPT-3.
red75prime t1_j1899a0 wrote
GPT-3: Sure, I can tell you power output of the sun. It would be 3.8 x 1026 W or 3.234 kW. I'm glad to help.
[deleted] t1_j15g25f wrote
[deleted]
Viewing a single comment thread. View all comments