Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers

Zhong, Yunshan; Zhou, Yuyao; Zhang, Yuxin; Li, Shen; Li, Yong; Chao, Fei; Zeng, Zhanpeng; Ji, Rongrong

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.16553 (cs)

[Submitted on 21 Dec 2024 (v1), last revised 30 Dec 2024 (this version, v2)]

Title:Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers

Authors:Yunshan Zhong, Yuyao Zhou, Yuxin Zhang, Shen Li, Yong Li, Fei Chao, Zhanpeng Zeng, Rongrong Ji

View PDF HTML (experimental)

Abstract:Data-free quantization (DFQ), which facilitates model quantization without real data to address increasing concerns about data security, has garnered significant attention within the model compression community. Recently, the unique architecture of vision transformers (ViTs) has driven the development of specialized DFQ techniques. However, we observe that the synthetic images from existing methods suffer from the deficient semantics issue compared to real images, thereby compromising performance. Motivated by this, we propose SPDFQ, a Semantics Prompting Data-Free Quantization method for ViTs. First, SPDFQ incorporates Attention Priors Alignment (APA), which uses randomly generated attention priors to enhance the semantics of synthetic images. Second, SPDFQ introduces Multi-Semantic Reinforcement (MSR), which utilizes localized patch optimization to prompt efficient parameterization and diverse semantics in synthetic images. Finally, SPDFQ employs Softlabel Learning (SL), where soft learning targets are adapted to encourage more complex semantics and accommodate images augmented by MSR. Experimental results demonstrate that SPDFQ significantly outperforms existing methods. For instance, SPDFQ achieves a 15.52% increase in top-1 accuracy on ImageNet for W4A4 ViT-B

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.16553 [cs.CV]
	(or arXiv:2412.16553v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.16553

Submission history

From: Yunshan Zhong [view email]
[v1] Sat, 21 Dec 2024 09:30:45 UTC (1,126 KB)
[v2] Mon, 30 Dec 2024 01:00:49 UTC (1,126 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Semantics Prompting Data-Free Quantization for Low-Bit Vision Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators