Resona: Improving Context Copying in Linear Recurrence Models with Retrieval

Wang, Xinyu; Ma, Linrui; Huang, Jerry; Lu, Peng; Parthasarathi, Prasanna; Chang, Xiao-Wen; Chen, Boxing; Cui, Yufei

Computer Science > Computation and Language

arXiv:2503.22913 (cs)

[Submitted on 28 Mar 2025]

Title:Resona: Improving Context Copying in Linear Recurrence Models with Retrieval

Authors:Xinyu Wang, Linrui Ma, Jerry Huang, Peng Lu, Prasanna Parthasarathi, Xiao-Wen Chang, Boxing Chen, Yufei Cui

View PDF HTML (experimental)

Abstract:Recent shifts in the space of large language model (LLM) research have shown an increasing focus on novel architectures to compete with prototypical Transformer-based models that have long dominated this space. Linear recurrent models have proven to be a viable competitor due to their computational efficiency. However, such models still demonstrate a sizable gap compared to Transformers in terms of in-context learning among other tasks that require recalling information from a context. In this work, we introduce __Resona__, a simple and scalable framework for augmenting linear recurrent models with retrieval. __Resona__~augments models with the ability to integrate retrieved information from the provided input context, enabling tailored behavior to diverse task requirements. Experiments on a variety of linear recurrent models demonstrate that __Resona__-augmented models observe significant performance gains on a variety of synthetic as well as real-world natural language tasks, highlighting its ability to act as a general purpose method to improve the in-context learning and language modeling abilities of linear recurrent LLMs.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2503.22913 [cs.CL]
	(or arXiv:2503.22913v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.22913

Submission history

From: Xinyu Wang [view email]
[v1] Fri, 28 Mar 2025 23:43:33 UTC (413 KB)

Computer Science > Computation and Language

Title:Resona: Improving Context Copying in Linear Recurrence Models with Retrieval

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Resona: Improving Context Copying in Linear Recurrence Models with Retrieval

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators