Deep k-NN Defense against Clean-label Data Poisoning Attacks

Peri, Neehar; Gupta, Neal; Huang, W. Ronny; Fowl, Liam; Zhu, Chen; Feizi, Soheil; Goldstein, Tom; Dickerson, John P.

Computer Science > Machine Learning

arXiv:1909.13374 (cs)

[Submitted on 29 Sep 2019 (v1), last revised 13 Aug 2020 (this version, v3)]

Title:Deep k-NN Defense against Clean-label Data Poisoning Attacks

Authors:Neehar Peri, Neal Gupta, W. Ronny Huang, Liam Fowl, Chen Zhu, Soheil Feizi, Tom Goldstein, John P. Dickerson

View PDF

Abstract:Targeted clean-label data poisoning is a type of adversarial attack on machine learning systems in which an adversary injects a few correctly-labeled, minimally-perturbed samples into the training data, causing a model to misclassify a particular test sample during inference. Although defenses have been proposed for general poisoning attacks, no reliable defense for clean-label attacks has been demonstrated, despite the attacks' effectiveness and realistic applications. In this work, we propose a simple, yet highly-effective Deep k-NN defense against both feature collision and convex polytope clean-label attacks on the CIFAR-10 dataset. We demonstrate that our proposed strategy is able to detect over 99% of poisoned examples in both attacks and remove them without compromising model performance. Additionally, through ablation studies, we discover simple guidelines for selecting the value of k as well as for implementing the Deep k-NN defense on real-world datasets with class imbalance. Our proposed defense shows that current clean-label poisoning attack strategies can be annulled, and serves as a strong yet simple-to-implement baseline defense to test future clean-label poisoning attacks. Our code is available at this https URL

Comments:	Accepted to ECCV 2020 Workshop - Adversarial Robustness in the Real World (AROW). First three authors contributed equally
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:1909.13374 [cs.LG]
	(or arXiv:1909.13374v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1909.13374

Submission history

From: Neehar Peri [view email]
[v1] Sun, 29 Sep 2019 21:47:14 UTC (1,564 KB)
[v2] Fri, 6 Mar 2020 02:38:02 UTC (1,312 KB)
[v3] Thu, 13 Aug 2020 05:47:23 UTC (1,508 KB)

Computer Science > Machine Learning

Title:Deep k-NN Defense against Clean-label Data Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep k-NN Defense against Clean-label Data Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators