The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks

Yu, Xin; Serra, Thiago; Ramalingam, Srikumar; Zhe, Shandian

Computer Science > Machine Learning

arXiv:2203.04466 (cs)

[Submitted on 9 Mar 2022 (v1), last revised 19 Jun 2022 (this version, v3)]

Title:The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks

Authors:Xin Yu, Thiago Serra, Srikumar Ramalingam, Shandian Zhe

View PDF

Abstract:Neural networks tend to achieve better accuracy with training if they are larger -- even if the resulting models are overparameterized. Nevertheless, carefully removing such excess parameters before, during, or after training may also produce models with similar or even improved accuracy. In many cases, that can be curiously achieved by heuristics as simple as removing a percentage of the weights with the smallest absolute value -- even though magnitude is not a perfect proxy for weight relevance. With the premise that obtaining significantly better performance from pruning depends on accounting for the combined effect of removing multiple weights, we revisit one of the classic approaches for impact-based pruning: the Optimal Brain Surgeon(OBS). We propose a tractable heuristic for solving the combinatorial extension of OBS, in which we select weights for simultaneous removal, as well as a systematic update of the remaining weights. Our selection method outperforms other methods under high sparsity, and the weight update is advantageous even when combined with the other methods.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2203.04466 [cs.LG]
	(or arXiv:2203.04466v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.04466

Submission history

From: Xin Yu [view email]
[v1] Wed, 9 Mar 2022 00:58:04 UTC (744 KB)
[v2] Sat, 12 Mar 2022 20:57:28 UTC (744 KB)
[v3] Sun, 19 Jun 2022 23:53:13 UTC (1,422 KB)

Computer Science > Machine Learning

Title:The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators