CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning

Meng, Fanxu; Tang, Pingzhi; jiang, Fan; Zhang, Muhan

Computer Science > Machine Learning

arXiv:2411.17426 (cs)

[Submitted on 26 Nov 2024 (v1), last revised 31 Jan 2025 (this version, v3)]

Title:CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning

Authors:Fanxu Meng, Pingzhi Tang, Fan jiang, Muhan Zhang

View PDF HTML (experimental)

Abstract:Decoder-only models generate tokens autoregressively by caching key/value vectors, but as the cache grows, inference becomes memory-bound. To address this issue, we introduce CLOVER (Cross-Layer Orthogonal Vectors), a novel approach that treats pairs of attention layers as a set of low-rank decompositions. CLOVER applies Singular Value Decomposition (SVD) to the \( Q \)-\( K \) and \( V \)-\( O \) pairs within each attention head. The resulting singular values can either guide pruning or serve as trainable parameters for efficient fine-tuning of all orthogonal vectors. After pruning or fine-tuning, these values are reintegrated into the model without increasing its parameter count. We apply CLOVER to various models, including GPT-2 XL, DeepSeek-V2-Lite, Whisper-Large-v3, Stable Diffusion XL, and LLaMA-3.2-11B-Vision. Our results demonstrate that CLOVER significantly improves pruning efficiency. For instance, the perplexity of pruning 70\% of the \( Q \)-\( K \) pairs in GPT-2 XL is similar to that of pruning just 8\% with vanilla methods. Fine-tuning the singular values further results in a full-rank update, outperforming state-of-the-art methods (LoRA, DoRA, HiRA, and PiSSA) by 7.6\%, 5.5\%, 3.8\%, and 0.7\%, respectively, on eight commonsense tasks for LLaMA-2 7B.

Comments:	this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2411.17426 [cs.LG]
	(or arXiv:2411.17426v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.17426

Submission history

From: Meng Fanxu [view email]
[v1] Tue, 26 Nov 2024 13:34:02 UTC (1,072 KB)
[v2] Sat, 21 Dec 2024 16:34:28 UTC (4,547 KB)
[v3] Fri, 31 Jan 2025 14:13:49 UTC (5,436 KB)

Computer Science > Machine Learning

Title:CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators