Classification Auto-Encoder based Detector against Diverse Data Poisoning Attacks

Razmi, Fereshteh; Xiong, Li

Computer Science > Machine Learning

arXiv:2108.04206 (cs)

[Submitted on 9 Aug 2021 (v1), last revised 16 May 2022 (this version, v2)]

Title:Classification Auto-Encoder based Detector against Diverse Data Poisoning Attacks

Authors:Fereshteh Razmi, Li Xiong

View PDF

Abstract:Poisoning attacks are a category of adversarial machine learning threats in which an adversary attempts to subvert the outcome of the machine learning systems by injecting crafted data into training data set, thus increasing the machine learning model's test error. The adversary can tamper with the data feature space, data labels, or both, each leading to a different attack strategy with different strengths. Various detection approaches have recently emerged, each focusing on one attack strategy. The Achilles heel of many of these detection approaches is their dependence on having access to a clean, untampered data set. In this paper, we propose CAE, a Classification Auto-Encoder based detector against diverse poisoned data. CAE can detect all forms of poisoning attacks using a combination of reconstruction and classification errors without having any prior knowledge of the attack strategy. We show that an enhanced version of CAE (called CAE+) does not have to employ a clean data set to train the defense model. Our experimental results on three real datasets MNIST, Fashion-MNIST and CIFAR demonstrate that our proposed method can maintain its functionality under up to 30% contaminated data and help the defended SVM classifier to regain its best accuracy.

Comments:	This work has been submitted to the IEEE for possible publication
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2108.04206 [cs.LG]
	(or arXiv:2108.04206v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2108.04206

Submission history

From: Fereshteh Razmi [view email]
[v1] Mon, 9 Aug 2021 17:46:52 UTC (3,814 KB)
[v2] Mon, 16 May 2022 20:11:45 UTC (2,491 KB)

Computer Science > Machine Learning

Title:Classification Auto-Encoder based Detector against Diverse Data Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Classification Auto-Encoder based Detector against Diverse Data Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators