Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks

Schwarzschild, Avi; Goldblum, Micah; Gupta, Arjun; Dickerson, John P; Goldstein, Tom

Computer Science > Machine Learning

arXiv:2006.12557 (cs)

[Submitted on 22 Jun 2020 (v1), last revised 17 Jun 2021 (this version, v3)]

Title:Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks

Authors:Avi Schwarzschild, Micah Goldblum, Arjun Gupta, John P Dickerson, Tom Goldstein

View PDF

Abstract:Data poisoning and backdoor attacks manipulate training data in order to cause models to fail during inference. A recent survey of industry practitioners found that data poisoning is the number one concern among threats ranging from model stealing to adversarial attacks. However, it remains unclear exactly how dangerous poisoning methods are and which ones are more effective considering that these methods, even ones with identical objectives, have not been tested in consistent or realistic settings. We observe that data poisoning and backdoor attacks are highly sensitive to variations in the testing setup. Moreover, we find that existing methods may not generalize to realistic settings. While these existing works serve as valuable prototypes for data poisoning, we apply rigorous tests to determine the extent to which we should fear them. In order to promote fair comparison in future work, we develop standardized benchmarks for data poisoning and backdoor attacks.

Comments:	19 pages, 4 figures
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV); Computers and Society (cs.CY); Machine Learning (stat.ML)
Cite as:	arXiv:2006.12557 [cs.LG]
	(or arXiv:2006.12557v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.12557

Submission history

From: Avi Schwarzschild [view email]
[v1] Mon, 22 Jun 2020 18:34:08 UTC (169 KB)
[v2] Mon, 2 Nov 2020 21:59:59 UTC (197 KB)
[v3] Thu, 17 Jun 2021 14:10:57 UTC (174 KB)

Computer Science > Machine Learning

Title:Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Just How Toxic is Data Poisoning? A Unified Benchmark for Backdoor and Data Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators