CoME: An Unlearning-based Approach to Conflict-free Model Editing

Jung, Dahyun; Seo, Jaehyung; Lee, Jaewook; Park, Chanjun; Lim, Heuiseok

Computer Science > Computation and Language

arXiv:2502.15826 (cs)

[Submitted on 20 Feb 2025]

Title:CoME: An Unlearning-based Approach to Conflict-free Model Editing

Authors:Dahyun Jung, Jaehyung Seo, Jaewook Lee, Chanjun Park, Heuiseok Lim

View PDF HTML (experimental)

Abstract:Large language models (LLMs) often retain outdated or incorrect information from pre-training, which undermines their reliability. While model editing methods have been developed to address such errors without full re-training, they frequently suffer from knowledge conflicts, where outdated information interferes with new knowledge. In this work, we propose Conflict-free Model Editing (CoME), a novel framework that enhances the accuracy of knowledge updates in LLMs by selectively removing outdated knowledge. CoME leverages unlearning to mitigate knowledge interference, allowing new information to be integrated without compromising relevant linguistic features. Through experiments on GPT-J and LLaMA-3 using Counterfact and ZsRE datasets, we demonstrate that CoME improves both editing accuracy and model reliability when applied to existing editing methods. Our results highlight that the targeted removal of outdated knowledge is crucial for enhancing model editing effectiveness and maintaining the model's generative performance.

Comments:	Accepted to NAACL 2025 main conference
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.15826 [cs.CL]
	(or arXiv:2502.15826v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.15826

Submission history

From: Dahyun Jung [view email]
[v1] Thu, 20 Feb 2025 04:55:38 UTC (905 KB)

Computer Science > Computation and Language

Title:CoME: An Unlearning-based Approach to Conflict-free Model Editing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CoME: An Unlearning-based Approach to Conflict-free Model Editing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators