Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models

Moon, Hyeonseok; Seo, Jaehyung; Lee, Seungyoon; Park, Chanjun; Lim, Heuiseok

Computer Science > Artificial Intelligence

arXiv:2412.19450 (cs)

[Submitted on 27 Dec 2024]

Title:Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models

Authors:Hyeonseok Moon, Jaehyung Seo, Seungyoon Lee, Chanjun Park, Heuiseok Lim

View PDF HTML (experimental)

Abstract:One of the key strengths of Large Language Models (LLMs) is their ability to interact with humans by generating appropriate responses to given instructions. This ability, known as instruction-following capability, has established a foundation for the use of LLMs across various fields and serves as a crucial metric for evaluating their performance. While numerous evaluation benchmarks have been developed, most focus solely on clear and coherent instructions. However, we have noted that LLMs can become easily distracted by instruction-formatted statements, which may lead to an oversight of their instruction comprehension skills. To address this issue, we introduce the Intention of Instruction (IoInst) benchmark. This benchmark evaluates LLMs' capacity to remain focused and understand instructions without being misled by extraneous instructions. The primary objective of this benchmark is to identify the appropriate instruction that accurately guides the generation of a given context. Our findings suggest that even recently introduced state-of-the-art models still lack instruction understanding capability. Along with the proposition of IoInst in this study, we also present broad analyses of the several strategies potentially applicable to IoInst.

Comments:	21 pages
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.19450 [cs.AI]
	(or arXiv:2412.19450v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2412.19450

Submission history

From: Hyeonseok Moon [view email]
[v1] Fri, 27 Dec 2024 04:37:39 UTC (385 KB)

Computer Science > Artificial Intelligence

Title:Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators