Preserving Deep Representations In One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework

Lucas, Ryan; Mazumder, Rahul

Computer Science > Machine Learning

arXiv:2411.18376 (cs)

[Submitted on 27 Nov 2024]

Title:Preserving Deep Representations In One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework

Authors:Ryan Lucas, Rahul Mazumder

View PDF HTML (experimental)

Abstract:We present SNOWS, a one-shot post-training pruning framework aimed at reducing the cost of vision network inference without retraining. Current leading one-shot pruning methods minimize layer-wise least squares reconstruction error which does not take into account deeper network representations. We propose to optimize a more global reconstruction objective. This objective accounts for nonlinear activations deep in the network to obtain a better proxy for the network loss. This nonlinear objective leads to a more challenging optimization problem -- we demonstrate it can be solved efficiently using a specialized second-order optimization framework. A key innovation of our framework is the use of Hessian-free optimization to compute exact Newton descent steps without needing to compute or store the full Hessian matrix. A distinct advantage of SNOWS is that it can be readily applied on top of any sparse mask derived from prior methods, readjusting their weights to exploit nonlinearities in deep feature representations. SNOWS obtains state-of-the-art results on various one-shot pruning benchmarks including residual networks and Vision Transformers (ViT/B-16 and ViT/L-16, 86m and 304m parameters respectively).

Comments:	10 pages excl. appendix
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2411.18376 [cs.LG]
	(or arXiv:2411.18376v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.18376

Submission history

From: Ryan Lucas [view email]
[v1] Wed, 27 Nov 2024 14:25:00 UTC (5,499 KB)

Computer Science > Machine Learning

Title:Preserving Deep Representations In One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Preserving Deep Representations In One-Shot Pruning: A Hessian-Free Second-Order Optimization Framework

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators