The Mirage of Model Editing: Revisiting Evaluation in the Wild

Yang, Wanli; Sun, Fei; Tan, Jiajun; Ma, Xinyu; Cao, Qi; Yin, Dawei; Shen, Huawei; Cheng, Xueqi

Computer Science > Computation and Language

arXiv:2502.11177 (cs)

[Submitted on 16 Feb 2025 (v1), last revised 18 Feb 2025 (this version, v2)]

Title:The Mirage of Model Editing: Revisiting Evaluation in the Wild

Authors:Wanli Yang, Fei Sun, Jiajun Tan, Xinyu Ma, Qi Cao, Dawei Yin, Huawei Shen, Xueqi Cheng

View PDF

Abstract:Despite near-perfect results in artificial evaluations, the effectiveness of model editing in real-world applications remains unexplored. To bridge this gap, we propose to study model editing in question answering (QA) by establishing a rigorous evaluation practice to assess the effectiveness of editing methods in correcting LLMs' errors. It consists of QAEdit, a new benchmark derived from popular QA datasets, and a standardized evaluation framework. Our single editing experiments indicate that current editing methods perform substantially worse than previously reported (38.5% vs. ~96%). Through module analysis and controlled experiments, we demonstrate that this performance decline stems from issues in evaluation practices of prior editing research. One key issue is the inappropriate use of teacher forcing in testing prevents error propagation by feeding ground truth tokens (inaccessible in real-world scenarios) as input. Furthermore, we simulate real-world deployment by sequential editing, revealing that current approaches fail drastically with only 1000 edits. Our analysis provides a fundamental reexamination of both the real-world applicability of existing model editing methods and their evaluation practices, and establishes a rigorous evaluation framework with key insights to advance reliable and practical model editing research.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.11177 [cs.CL]
	(or arXiv:2502.11177v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.11177

Submission history

From: Wanli Yang [view email]
[v1] Sun, 16 Feb 2025 15:57:55 UTC (157 KB)
[v2] Tue, 18 Feb 2025 12:31:49 UTC (158 KB)

Computer Science > Computation and Language

Title:The Mirage of Model Editing: Revisiting Evaluation in the Wild

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:The Mirage of Model Editing: Revisiting Evaluation in the Wild

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators