AdaSVD: Adaptive Singular Value Decomposition for Large Language Models

Li, Zhiteng; Xia, Mingyuan; Zhang, Jingyuan; Hui, Zheng; Kong, Linghe; Zhang, Yulun; Yang, Xiaokang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.01403 (cs)

[Submitted on 3 Feb 2025 (v1), last revised 4 Feb 2025 (this version, v2)]

Title:AdaSVD: Adaptive Singular Value Decomposition for Large Language Models

Authors:Zhiteng Li, Mingyuan Xia, Jingyuan Zhang, Zheng Hui, Linghe Kong, Yulun Zhang, Xiaokang Yang

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have achieved remarkable success in natural language processing (NLP) tasks, yet their substantial memory requirements present significant challenges for deployment on resource-constrained devices. Singular Value Decomposition (SVD) has emerged as a promising compression technique for LLMs, offering considerable reductions in memory overhead. However, existing SVD-based methods often struggle to effectively mitigate the errors introduced by SVD truncation, leading to a noticeable performance gap when compared to the original models. Furthermore, applying a uniform compression ratio across all transformer layers fails to account for the varying importance of different layers. To address these challenges, we propose AdaSVD, an adaptive SVD-based LLM compression approach. Specifically, AdaSVD introduces adaComp, which adaptively compensates for SVD truncation errors by alternately updating the singular matrices U and V^T. Additionally, AdaSVD introduces adaCR, which adaptively assigns layer-specific compression ratios based on the relative importance of each layer. Extensive experiments across multiple LLM families and evaluation metrics demonstrate that AdaSVD consistently outperforms state-of-the-art (SOTA) SVD-based methods, achieving superior performance with significantly reduced memory requirements. The code and models will be available at this https URL.

Comments:	The code and models will be available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2502.01403 [cs.CV]
	(or arXiv:2502.01403v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.01403

Submission history

From: Zhiteng Li [view email]
[v1] Mon, 3 Feb 2025 14:34:37 UTC (1,720 KB)
[v2] Tue, 4 Feb 2025 03:51:28 UTC (1,720 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:AdaSVD: Adaptive Singular Value Decomposition for Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:AdaSVD: Adaptive Singular Value Decomposition for Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators