Understanding Diversity based Pruning of Neural Networks -- Statistical Mechanical Analysis

Acharyya, Rupam; Zhang, Boyu; Chattoraj, Ankani; Das, Shouman; Stefankovic, Daniel

Computer Science > Machine Learning

arXiv:2006.16617v2 (cs)

[Submitted on 30 Jun 2020 (v1), revised 25 Feb 2021 (this version, v2), latest version 11 Jun 2021 (v3)]

Title:Understanding Diversity based Pruning of Neural Networks -- Statistical Mechanical Analysis

Authors:Rupam Acharyya, Boyu Zhang, Ankani Chattoraj, Shouman Das, Daniel Stefankovic

View PDF

Abstract:Deep learning architectures with a huge number of parameters are often compressed using pruning techniques to ensure computational efficiency of inference during deployment. Despite multitude of empirical advances, there is no theoretical understanding of the effectiveness of different pruning methods. We address this issue by setting up the problem in the statistical mechanics formulation of a teacher-student framework and deriving generalization error (GE) bounds of specific pruning methods. This theoretical premise allows comparison between pruning methods and we use it to investigate compression of neural networks via diversity-based pruning methods. A recent work showed that Determinantal Point Process (DPP) based node pruning method is notably superior to competing approaches when tested on real datasets. Using GE bounds in the aforementioned setup we provide theoretical guarantees for their empirical observations. Another consistent finding in literature is that sparse neural networks (edge pruned) generalize better than dense neural networks (node pruned) for a fixed number of parameters. We use our theoretical setup to prove that baseline random edge pruning method performs better than DPP node pruning method. Finally, we draw motivation from our theoretical results to propose a DPP edge pruning technique for neural networks which empirically outperforms other competing pruning methods on real datasets.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2006.16617 [cs.LG]
	(or arXiv:2006.16617v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.16617

Submission history

From: Rupam Acharyya [view email]
[v1] Tue, 30 Jun 2020 09:15:25 UTC (5,355 KB)
[v2] Thu, 25 Feb 2021 03:34:28 UTC (5,355 KB)
[v3] Fri, 11 Jun 2021 04:34:46 UTC (9,538 KB)

Computer Science > Machine Learning

Title:Understanding Diversity based Pruning of Neural Networks -- Statistical Mechanical Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Understanding Diversity based Pruning of Neural Networks -- Statistical Mechanical Analysis

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators