DENIAHL: In-Context Features Influence LLM Needle-In-A-Haystack Abilities

Dai, Hui; Pechi, Dan; Yang, Xinyi; Banga, Garvit; Mantri, Raghav

Computer Science > Computation and Language

arXiv:2411.19360 (cs)

[Submitted on 28 Nov 2024]

Title:DENIAHL: In-Context Features Influence LLM Needle-In-A-Haystack Abilities

Authors:Hui Dai, Dan Pechi, Xinyi Yang, Garvit Banga, Raghav Mantri

View PDF HTML (experimental)

Abstract:The Needle-in-a-haystack (NIAH) test is a general task used to assess language models' (LMs') abilities to recall particular information from long input context. This framework however does not provide a means of analyzing what factors, beyond context length, contribute to LMs' abilities or inabilities to separate and recall needles from their haystacks. To provide a systematic means of assessing what features contribute to LMs' NIAH capabilities, we developed a synthetic benchmark called DENIAHL (Data-oriented Evaluation of NIAH for LLM's). Our work expands on previous NIAH studies by ablating NIAH features beyond typical context length including data type, size, and patterns. We find stark differences between GPT-3.5 and LLaMA 2-7B's performance on DENIAHL, and drops in recall performance when features like item size are increased, and to some degree when data type is changed from numbers to letters. This has implications for increasingly large context models, demonstrating factors beyond item-number impact NIAH capabilities.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2411.19360 [cs.CL]
	(or arXiv:2411.19360v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2411.19360

Submission history

From: Hui Dai [view email]
[v1] Thu, 28 Nov 2024 20:14:47 UTC (9,098 KB)

Computer Science > Computation and Language

Title:DENIAHL: In-Context Features Influence LLM Needle-In-A-Haystack Abilities

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DENIAHL: In-Context Features Influence LLM Needle-In-A-Haystack Abilities

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators