EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts

Chaudhury, Subhajit; Das, Payel; Swaminathan, Sarathkrishna; Kollias, Georgios; Nelson, Elliot; Pahwa, Khushbu; Pedapati, Tejaswini; Melnyk, Igor; Riemer, Matthew

Computer Science > Computation and Language

arXiv:2502.14280 (cs)

[Submitted on 20 Feb 2025]

Title:EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts

Authors:Subhajit Chaudhury, Payel Das, Sarathkrishna Swaminathan, Georgios Kollias, Elliot Nelson, Khushbu Pahwa, Tejaswini Pedapati, Igor Melnyk, Matthew Riemer

View PDF HTML (experimental)

Abstract:Recent advances in Large Language Models (LLMs) have yielded impressive successes on many language tasks. However, efficient processing of long contexts using LLMs remains a significant challenge. We introduce \textbf{EpMAN} -- a method for processing long contexts in an \textit{episodic memory} module while \textit{holistically attending to} semantically relevant context chunks. The output of \textit{episodic attention} is then used to reweigh the decoder's self-attention to the stored KV cache of the context during training and generation. When an LLM decoder is trained using \textbf{EpMAN}, its performance on multiple challenging single-hop long-context recall and question-answering benchmarks is found to be stronger and more robust across the range from 16k to 256k tokens than baseline decoders trained with self-attention, and popular retrieval-augmented generation frameworks.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.14280 [cs.CL]
	(or arXiv:2502.14280v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.14280

Submission history

From: Subhajit Chaudhury [view email]
[v1] Thu, 20 Feb 2025 05:41:15 UTC (312 KB)

Computer Science > Computation and Language

Title:EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators