Self-Supervised Quantization-Aware Knowledge Distillation

Zhao, Kaiqi; Zhao, Ming

Computer Science > Machine Learning

arXiv:2403.11106 (cs)

[Submitted on 17 Mar 2024]

Title:Self-Supervised Quantization-Aware Knowledge Distillation

Authors:Kaiqi Zhao, Ming Zhao

View PDF HTML (experimental)

Abstract:Quantization-aware training (QAT) and Knowledge Distillation (KD) are combined to achieve competitive performance in creating low-bit deep learning models. However, existing works applying KD to QAT require tedious hyper-parameter tuning to balance the weights of different loss terms, assume the availability of labeled training data, and require complex, computationally intensive training procedures for good performance. To address these limitations, this paper proposes a novel Self-Supervised Quantization-Aware Knowledge Distillation (SQAKD) framework. SQAKD first unifies the forward and backward dynamics of various quantization functions, making it flexible for incorporating various QAT works. Then it formulates QAT as a co-optimization problem that simultaneously minimizes the KL-Loss between the full-precision and low-bit models for KD and the discretization error for quantization, without supervision from labels. A comprehensive evaluation shows that SQAKD substantially outperforms the state-of-the-art QAT and KD works for a variety of model architectures. Our code is at: this https URL.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.11106 [cs.LG]
	(or arXiv:2403.11106v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.11106

Submission history

From: Kaiqi Zhao [view email]
[v1] Sun, 17 Mar 2024 06:20:28 UTC (2,632 KB)

Computer Science > Machine Learning

Title:Self-Supervised Quantization-Aware Knowledge Distillation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Self-Supervised Quantization-Aware Knowledge Distillation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators