Activated LoRA: Fine-tuned LLMs for Intrinsics

Greenewald, Kristjan; Lastras, Luis; Parnell, Thomas; Shah, Vraj; Popa, Lucian; Zizzo, Giulio; Gunasekara, Chulaka; Rawat, Ambrish; Cox, David

Computer Science > Machine Learning

arXiv:2504.12397 (cs)

[Submitted on 16 Apr 2025]

Title:Activated LoRA: Fine-tuned LLMs for Intrinsics

Authors:Kristjan Greenewald, Luis Lastras, Thomas Parnell, Vraj Shah, Lucian Popa, Giulio Zizzo, Chulaka Gunasekara, Ambrish Rawat, David Cox

View PDF HTML (experimental)

Abstract:Low-Rank Adaptation (LoRA) has emerged as a highly efficient framework for finetuning the weights of large foundation models, and has become the go-to method for data-driven customization of LLMs. Despite the promise of highly customized behaviors and capabilities, switching between relevant LoRAs in a multiturn setting is highly inefficient, as the key-value (KV) cache of the entire turn history must be recomputed with the LoRA weights before generation can begin. To address this problem, we propose Activated LoRA (aLoRA), which modifies the LoRA framework to only adapt weights for the tokens in the sequence \emph{after} the aLoRA is invoked. This change crucially allows aLoRA to accept the base model's KV cache of the input string, meaning that aLoRA can be instantly activated whenever needed in a chain without recomputing the cache. This enables building what we call \emph{intrinsics}, i.e. highly specialized models invoked to perform well-defined operations on portions of an input chain or conversation that otherwise uses the base model by default. We use aLoRA to train a set of intrinsics models, demonstrating competitive accuracy with standard LoRA while achieving significant inference benefits.

Comments:	arXiv admin note: text overlap with arXiv:2504.11704
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2504.12397 [cs.LG]
	(or arXiv:2504.12397v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2504.12397

Submission history

From: Kristjan Greenewald [view email]
[v1] Wed, 16 Apr 2025 18:03:21 UTC (236 KB)

Computer Science > Machine Learning

Title:Activated LoRA: Fine-tuned LLMs for Intrinsics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Activated LoRA: Fine-tuned LLMs for Intrinsics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators