Submitted by Secure-Technology-78 t3_10mdhxb in MachineLearning
anony_sci_guy t1_j6mr4k6 wrote
Reply to comment by starfries in [R] SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot by Secure-Technology-78
Glad it helped! The first thing I tried was just to re-initialize just like at the beginning of training, but I don't remember how much I dug into trying to modify it before moving on. That's great your seeing some improvements though! Would love to hear how the rest of your experiment goes!! =)
Viewing a single comment thread. View all comments