LLMs can be easily Confused by Instructional Distractions

Hwang, Yerin; Kim, Yongil; Koo, Jahyun; Kang, Taegwan; Bae, Hyunkyung; Jung, Kyomin

Computer Science > Computation and Language

arXiv:2502.04362 (cs)

[Submitted on 5 Feb 2025]

Title:LLMs can be easily Confused by Instructional Distractions

Authors:Yerin Hwang, Yongil Kim, Jahyun Koo, Taegwan Kang, Hyunkyung Bae, Kyomin Jung

View PDF HTML (experimental)

Abstract:Despite the fact that large language models (LLMs) show exceptional skill in instruction following tasks, this strength can turn into a vulnerability when the models are required to disregard certain instructions. Instruction-following tasks typically involve a clear task description and input text containing the target data to be processed. However, when the input itself resembles an instruction, confusion may arise, even if there is explicit prompting to distinguish between the task instruction and the input. We refer to this phenomenon as instructional distraction. In this paper, we introduce a novel benchmark, named DIM-Bench, specifically designed to assess LLMs' performance under instructional distraction. The benchmark categorizes real-world instances of instructional distraction and evaluates LLMs across four instruction tasks: rewriting, proofreading, translation, and style transfer -- alongside five input tasks: reasoning, code generation, mathematical reasoning, bias detection, and question answering. Our experimental results reveal that even the most advanced LLMs are susceptible to instructional distraction, often failing to accurately follow user intent in such cases.

Comments:	8 pages
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.04362 [cs.CL]
	(or arXiv:2502.04362v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.04362

Submission history

From: Yerin Hwang [view email]
[v1] Wed, 5 Feb 2025 04:52:57 UTC (9,009 KB)

Computer Science > Computation and Language

Title:LLMs can be easily Confused by Instructional Distractions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LLMs can be easily Confused by Instructional Distractions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators