LoRE-Merging: Exploring Low-Rank Estimation For Large Language Model Merging

Liu, Zehua; Wu, Han; Yao, Yuxuan; She, Ruifeng; Han, Xiongwei; Zhong, Tao; Yuan, Mingxuan

Computer Science > Computation and Language

arXiv:2502.10749 (cs)

[Submitted on 15 Feb 2025]

Title:LoRE-Merging: Exploring Low-Rank Estimation For Large Language Model Merging

Authors:Zehua Liu, Han Wu, Yuxuan Yao, Ruifeng She, Xiongwei Han, Tao Zhong, Mingxuan Yuan

View PDF HTML (experimental)

Abstract:While most current approaches rely on further training techniques, such as fine-tuning or reinforcement learning, to enhance model capacities, model merging stands out for its ability of improving models without requiring any additional training. In this paper, we propose a unified framework for model merging based on low-rank estimation of task vectors without the need for access to the base model, named \textsc{LoRE-Merging}. Our approach is motivated by the observation that task vectors from fine-tuned models frequently exhibit a limited number of dominant singular values, making low-rank estimations less prone to interference. We implement the method by formulating the merging problem as an optimization problem. Extensive empirical experiments demonstrate the effectiveness of our framework in mitigating interference and preserving task-specific information, thereby advancing the state-of-the-art performance in model merging techniques.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.10749 [cs.CL]
	(or arXiv:2502.10749v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.10749

Submission history

From: Han Wu [view email]
[v1] Sat, 15 Feb 2025 10:18:46 UTC (3,386 KB)

Computer Science > Computation and Language

Title:LoRE-Merging: Exploring Low-Rank Estimation For Large Language Model Merging

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LoRE-Merging: Exploring Low-Rank Estimation For Large Language Model Merging

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators