Phoneaccount25732 t1_ivydmgs wrote on November 11, 2022 at 3:11 PM

I want more comments like this.

9182763498761234 t1_ivy1mud wrote on November 11, 2022 at 1:43 PM

Cool, thanks for sharing :-)

robbsc t1_ivypqg0 wrote on November 11, 2022 at 4:34 PM

Thanks for taking the time to type this out

samloveshummus t1_iw1o1jg wrote on November 12, 2022 at 6:39 AM

This has to be one of the most useful comments I've read in nearly ten years on Reddit! You must be a gifted teacher.

[deleted] t1_iw2kgbt wrote on November 12, 2022 at 1:47 PM

[deleted]

zimonitrome t1_iwbmzoq wrote on November 14, 2022 at 1:13 PM

Huber loss let's go.

maybelator t1_iwbpkjo wrote on November 14, 2022 at 1:36 PM

Not if you want true sparsity !

zimonitrome t1_iwbst8p wrote on November 14, 2022 at 2:04 PM

Can you elaborate?

maybelator t1_iwbxutj wrote on November 14, 2022 at 2:43 PM

The Huber loss encourages the regularized variable to be close to 0. However, this loss is also smooth: the amplitude of the gradient decreases as the variable nears its stationary point. In consequence, it will have many coordinates close to 0 but not exactly. Achieving true sparsity requires thresholding which adds a a lot of other complications.

In contrast the amplitude of the gradient of the L1 norm (absolute value in dim 1) remain the same no matter how close it gets to 0. The functional has a kink (the subgradient contains a neighborhood of 0). In consequence, if you used a well-suited optimization algorithm, the variable will have true sparsity, i.e. a lot of exact 0.

zimonitrome t1_iwc14i5 wrote on November 14, 2022 at 3:07 PM

Wow thanks for the explanation, it does make sense.

I had a pre-conception that all optimizers dealing with any linear functions (kinda like L1 norm) still produce values close to 0.

I can see someone disregarding tiny values when using said sparsity (pruning, quantization) but didn't think that it would be exactly 0.

[R] ZerO Initialization: Initializing Neural Networks with only Zeros and Ones

maybelator t1_ivxgacq wrote on November 11, 2022 at 9:35 AM