MaxQ: Multi-Axis Query for N:M Sparsity Network

Xiang, Jingyang; Li, Siqi; Chen, Junhao; Chen, Zhuangzhi; Huang, Tianxin; Peng, Linpeng; Liu, Yong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.07061 (cs)

[Submitted on 12 Dec 2023 (v1), last revised 17 Mar 2024 (this version, v2)]

Title:MaxQ: Multi-Axis Query for N:M Sparsity Network

Authors:Jingyang Xiang, Siqi Li, Junhao Chen, Zhuangzhi Chen, Tianxin Huang, Linpeng Peng, Yong Liu

View PDF HTML (experimental)

Abstract:N:M sparsity has received increasing attention due to its remarkable performance and latency trade-off compared with structured and unstructured sparsity. However, existing N:M sparsity methods do not differentiate the relative importance of weights among blocks and leave important weights underappreciated. Besides, they directly apply N:M sparsity to the whole network, which will cause severe information loss. Thus, they are still sub-optimal. In this paper, we propose an efficient and effective Multi-Axis Query methodology, dubbed as MaxQ, to rectify these problems. During the training, MaxQ employs a dynamic approach to generate soft N:M masks, considering the weight importance across multiple axes. This method enhances the weights with more importance and ensures more effective updates. Meanwhile, a sparsity strategy that gradually increases the percentage of N:M weight blocks is applied, which allows the network to heal from the pruning-induced damage progressively. During the runtime, the N:M soft masks can be precomputed as constants and folded into weights without causing any distortion to the sparse pattern and incurring additional computational overhead. Comprehensive experiments demonstrate that MaxQ achieves consistent improvements across diverse CNN architectures in various computer vision tasks, including image classification, object detection and instance segmentation. For ResNet50 with 1:16 sparse pattern, MaxQ can achieve 74.6\% top-1 accuracy on ImageNet and improve by over 2.8\% over the state-of-the-art. Codes and checkpoints are available at \url{this https URL}.

Comments:	Accepted by the IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 (CVPR2024)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2312.07061 [cs.CV]
	(or arXiv:2312.07061v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.07061

Submission history

From: Jingyang Xiang [view email]
[v1] Tue, 12 Dec 2023 08:28:29 UTC (1,412 KB)
[v2] Sun, 17 Mar 2024 03:17:47 UTC (1,542 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MaxQ: Multi-Axis Query for N:M Sparsity Network

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MaxQ: Multi-Axis Query for N:M Sparsity Network

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators