Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

Nguyen, Quang H.; Ngoc-Hieu, Nguyen; Ta, The-Anh; Nguyen-Tang, Thanh; Wong, Kok-Seng; Thanh-Tung, Hoang; Doan, Khoa D.

Computer Science > Machine Learning

arXiv:2407.10825 (cs)

[Submitted on 15 Jul 2024 (v1), last revised 16 Jul 2024 (this version, v2)]

Title:Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

Authors:Quang H. Nguyen, Nguyen Ngoc-Hieu, The-Anh Ta, Thanh Nguyen-Tang, Kok-Seng Wong, Hoang Thanh-Tung, Khoa D. Doan

View PDF HTML (experimental)

Abstract:Deep neural networks are vulnerable to backdoor attacks, a type of adversarial attack that poisons the training data to manipulate the behavior of models trained on such data. Clean-label attacks are a more stealthy form of backdoor attacks that can perform the attack without changing the labels of poisoned data. Early works on clean-label attacks added triggers to a random subset of the training set, ignoring the fact that samples contribute unequally to the attack's success. This results in high poisoning rates and low attack success rates. To alleviate the problem, several supervised learning-based sample selection strategies have been proposed. However, these methods assume access to the entire labeled training set and require training, which is expensive and may not always be practical. This work studies a new and more practical (but also more challenging) threat model where the attacker only provides data for the target class (e.g., in face recognition systems) and has no knowledge of the victim model or any other classes in the training set. We study different strategies for selectively poisoning a small set of training samples in the target class to boost the attack success rate in this setting. Our threat model poses a serious threat in training machine learning models with third-party datasets, since the attack can be performed effectively with limited information. Experiments on benchmark datasets illustrate the effectiveness of our strategies in improving clean-label backdoor attacks.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.10825 [cs.LG]
	(or arXiv:2407.10825v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.10825

Submission history

From: Quang Nguyen [view email]
[v1] Mon, 15 Jul 2024 15:38:21 UTC (2,502 KB)
[v2] Tue, 16 Jul 2024 04:21:12 UTC (2,502 KB)

Computer Science > Machine Learning

Title:Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators