muchcharles
muchcharles t1_j87nheg wrote
Schmidhuber prior art
muchcharles t1_j65b3a6 wrote
Reply to comment by element8 in [R] SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot by Secure-Technology-78
Deepmind put out a paper on adjusting the pruning mask during training (by reviving pruned weights if a transiently stored gradient exceeds some threshold).
The paper is called Rigging the Lottery (referencing initial weight lottery hypothesis) and method RigL I think.
muchcharles t1_j6530pm wrote
Reply to comment by wren42 in ⭕ What People Are Missing About Microsoft’s $10B Investment In OpenAI by LesleyFair
Didn't they just release Whisper for everyone, including the trained weights? And stable diffusion I think used their clip model initially.
muchcharles t1_j652n0u wrote
You should probably mention Microsoft's margin on those cloud credits. Their $10b investment didn't cost them $10b. Their return is capped at much more than 10X, seemingly using Azure cloud credits as a way of working around OpenAI's maximum profit corporate bylaws.
And if OpenAI nets enough profit for a future multi trillion dollar valuation, what is OpenAI's margin? The non margin part will partly (mostly?) be cloud bills with Azure exceeding the $10billion credits by a lot. Part of the investment was Azure exclusivity. If they become Azure's #1 customer, MS can set the pricing to whatever they want to claw back more as long as they drop their other customers (assuming this is an AGI scenario).
Basically OpenAI's profit cap is laundered and bypassed into MS cloud bills and MS is the defacto owner of OpenAI through that laundering system if OpenAI becomes huge but needs lots of compute.
muchcharles t1_j5mbjtb wrote
Reply to comment by Avelina9X in [D] Did YouTube just add upscaling? by Avelina9X
> Dedicated graphics is completely idle during this.
Are you sure fixed function decoder/upscale stuff is reported in GPU utilization graphs?
muchcharles t1_iv6xcb5 wrote
Reply to comment by now-here-be in TSMC approaching 1 nm with 2D materials breakthrough by maxtility
Increasing linear density by 2X (not necessarily happening depending on how they are applying the marketing term to actual sizes) means quadrupling the number of transistors.
muchcharles t1_iujosdj wrote
Reply to comment by Geneocrat in Giant farming robot uses 3D vision and robotic arms to harvest ripe strawberries by Anen-o-me
> The energy required to recognize the fruit
Have you actually worked this out? ML inference is much less energy intensive than training.
muchcharles t1_iujojqr wrote
Reply to comment by Quealdlor in Giant farming robot uses 3D vision and robotic arms to harvest ripe strawberries by Anen-o-me
> there has not been any UBI.
The earned income tax credit (EITC) is pretty close.
muchcharles t1_jee5c78 wrote
Reply to comment by CypherLH in When will AI actually start taking jobs? by Weeb_Geek_7779
More to do with interest rates, copying Musk, and some overhiring during covid.