Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization

Li, Jingyu; Liu, Wei; Zhang, Zhaoyang; Wang, Jiong; Lee, Tan

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2210.17326v4 (eess)

[Submitted on 31 Oct 2022 (v1), revised 28 Feb 2023 (this version, v4), latest version 25 Sep 2023 (v5)]

Title:Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization

Authors:Jingyu Li, Wei Liu, Zhaoyang Zhang, Jiong Wang, Tan Lee

View PDF

Abstract:DNN-based models achieve significant performance in the speaker verification (SV) task with substantial computation costs. Model compression can be applied to reduce the model size for lower resource consumption. The present study exploits weight quantization to compress two widely-used SV models, ECAPA-TDNN and ResNet. The experiments on VoxCeleb indicate that quantization is effective for compressing SV models, where the model size can be reduced by multiple times with no noticeable performance decline. ResNet achieves more robust results than ECAPA-TDNN using lower-bitwidth quantization. The analysis of layer weights shows that the smooth distribution of ResNet may contribute to its robust results. The additional experiments on CN-Celeb validate the quantized model's generalization ability in the language mismatch scenario. Furthermore, information probing results demonstrate that the quantized models can preserve most of the learned speaker-relevant knowledge compared to the original models.

Comments:	Correct some results
Subjects:	Audio and Speech Processing (eess.AS); Sound (cs.SD)
Cite as:	arXiv:2210.17326 [eess.AS]
	(or arXiv:2210.17326v4 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2210.17326

Submission history

From: Jingyu Li [view email]
[v1] Mon, 31 Oct 2022 13:46:47 UTC (188 KB)
[v2] Wed, 9 Nov 2022 14:54:50 UTC (188 KB)
[v3] Mon, 14 Nov 2022 14:24:51 UTC (1 KB) (withdrawn)
[v4] Tue, 28 Feb 2023 09:00:42 UTC (631 KB)
[v5] Mon, 25 Sep 2023 14:29:03 UTC (378 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators