BridgePure: Revealing the Fragility of Black-box Data Protection

Wang, Yihan; Lu, Yiwei; Gao, Xiao-Shan; Kamath, Gautam; Yu, Yaoliang

Computer Science > Machine Learning

arXiv:2412.21061 (cs)

[Submitted on 30 Dec 2024]

Title:BridgePure: Revealing the Fragility of Black-box Data Protection

Authors:Yihan Wang, Yiwei Lu, Xiao-Shan Gao, Gautam Kamath, Yaoliang Yu

View PDF

Abstract:Availability attacks, or unlearnable examples, are defensive techniques that allow data owners to modify their datasets in ways that prevent unauthorized machine learning models from learning effectively while maintaining the data's intended functionality. It has led to the release of popular black-box tools for users to upload personal data and receive protected counterparts. In this work, we show such black-box protections can be substantially bypassed if a small set of unprotected in-distribution data is available. Specifically, an adversary can (1) easily acquire (unprotected, protected) pairs by querying the black-box protections with the unprotected dataset; and (2) train a diffusion bridge model to build a mapping. This mapping, termed BridgePure, can effectively remove the protection from any previously unseen data within the same distribution. Under this threat model, our method demonstrates superior purification performance on classification and style mimicry tasks, exposing critical vulnerabilities in black-box data protection.

Comments:	26 pages,13 figures
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2412.21061 [cs.LG]
	(or arXiv:2412.21061v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.21061

Submission history

From: Yihan Wang [view email]
[v1] Mon, 30 Dec 2024 16:30:50 UTC (36,922 KB)

Computer Science > Machine Learning

Title:BridgePure: Revealing the Fragility of Black-box Data Protection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:BridgePure: Revealing the Fragility of Black-box Data Protection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators