Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning

Song, Haobo; Zhao, Hao; Majumder, Soumajit; Lin, Tao

Computer Science > Machine Learning

arXiv:2407.01320 (cs)

[Submitted on 1 Jul 2024]

Title:Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning

Authors:Haobo Song, Hao Zhao, Soumajit Majumder, Tao Lin

View PDF HTML (experimental)

Abstract:Fine-tuning large pre-trained foundation models, such as the 175B GPT-3, has attracted more attention for downstream tasks recently. While parameter-efficient fine-tuning methods have been proposed and proven effective without retraining all model parameters, their performance is limited by the capacity of incremental modules, especially under constrained parameter budgets. \\ To overcome this challenge, we propose CapaBoost, a simple yet effective strategy that enhances model capacity by leveraging low-rank updates through parallel weight modules in target layers. By applying static random masks to the shared weight matrix, CapaBoost constructs a diverse set of weight matrices, effectively increasing the rank of incremental weights without adding parameters. Notably, our approach can be seamlessly integrated into various existing parameter-efficient fine-tuning methods. We extensively validate the efficacy of CapaBoost through experiments on diverse downstream tasks, including natural language understanding, question answering, and image classification. Our results demonstrate significant improvements over baselines, without incurring additional computation or storage costs. Our code is available at \url{this https URL}.

Comments:	Accepted at ICLR 2024. Code at this https URL
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2407.01320 [cs.LG]
	(or arXiv:2407.01320v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.01320

Submission history

From: Hao Zhao [view email]
[v1] Mon, 1 Jul 2024 14:26:48 UTC (292 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Machine Learning

Title:Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators