Verifying Robust Unlearning: Probing Residual Knowledge in Unlearned Models

Xuan, Hao; Li, Xingyu

Computer Science > Machine Learning

arXiv:2504.14798 (cs)

[Submitted on 21 Apr 2025]

Title:Verifying Robust Unlearning: Probing Residual Knowledge in Unlearned Models

Authors:Hao Xuan, Xingyu Li

View PDF HTML (experimental)

Abstract:Machine Unlearning (MUL) is crucial for privacy protection and content regulation, yet recent studies reveal that traces of forgotten information persist in unlearned models, enabling adversaries to resurface removed knowledge. Existing verification methods only confirm whether unlearning was executed, failing to detect such residual information leaks. To address this, we introduce the concept of Robust Unlearning, ensuring models are indistinguishable from retraining and resistant to adversarial recovery. To empirically evaluate whether unlearning techniques meet this security standard, we propose the Unlearning Mapping Attack (UMA), a post-unlearning verification framework that actively probes models for forgotten traces using adversarial queries. Extensive experiments on discriminative and generative tasks show that existing unlearning techniques remain vulnerable, even when passing existing verification metrics. By establishing UMA as a practical verification tool, this study sets a new standard for assessing and enhancing machine unlearning security.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.14798 [cs.LG]
	(or arXiv:2504.14798v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2504.14798

Submission history

From: Hao Xuan [view email]
[v1] Mon, 21 Apr 2025 01:56:15 UTC (23,673 KB)

Computer Science > Machine Learning

Title:Verifying Robust Unlearning: Probing Residual Knowledge in Unlearned Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Verifying Robust Unlearning: Probing Residual Knowledge in Unlearned Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators