ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion

Khan, Rana Muhammad Shahroz; Tang, Dongwen; Li, Pingzhi; Wang, Kai; Chen, Tianlong

Computer Science > Machine Learning

arXiv:2503.24354 (cs)

[Submitted on 31 Mar 2025 (v1), last revised 8 Apr 2025 (this version, v2)]

Title:ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion

Authors:Rana Muhammad Shahroz Khan, Dongwen Tang, Pingzhi Li, Kai Wang, Tianlong Chen

View PDF HTML (experimental)

Abstract:Parameter generation has emerged as a novel paradigm for neural network development, offering an alternative to traditional neural network training by synthesizing high-quality model weights directly. In the context of Low-Rank Adaptation (LoRA) for evolving ($\textit{i.e.}$, constantly updated) large language models (LLMs), this approach promises efficient adaptation without costly retraining. However, existing methods face critical limitations in simultaneously achieving scalability and controllability. In this paper, we introduce $\texttt{ORAL}$, a novel $\textbf{conditional recurrent diffusion}$ framework that addresses these challenges. $\texttt{ORAL}$ incorporates a novel conditioning mechanism that integrates model architecture and textual task specifications, enabling the generation of task-specific LoRA parameters that can seamlessly transfer across evolving foundation models. Our approach successfully scales to billions-of-parameter LLMs and maintains controllability. Through extensive experiments across seven language tasks, four vision tasks, and three multimodal tasks using five pre-trained LLMs, we demonstrate that $\texttt{ORAL}$ generates high-quality LoRA parameters that achieve comparable or superior performance to vanilla trained counterparts.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2503.24354 [cs.LG]
	(or arXiv:2503.24354v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.24354

Submission history

From: Rana Muhammad Shahroz Khan [view email]
[v1] Mon, 31 Mar 2025 17:34:59 UTC (218 KB)
[v2] Tue, 8 Apr 2025 18:38:56 UTC (217 KB)

Computer Science > Machine Learning

Title:ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators