Active Learning Under Malicious Mislabeling and Poisoning Attacks

Lin, Jing; Luley, Ryan; Xiong, Kaiqi

Computer Science > Machine Learning

arXiv:2101.00157v2 (cs)

[Submitted on 1 Jan 2021 (v1), revised 24 Mar 2021 (this version, v2), latest version 2 Sep 2021 (v4)]

Title:Active Learning Under Malicious Mislabeling and Poisoning Attacks

Authors:Jing Lin, Ryan Luley, Kaiqi Xiong

View PDF

Abstract:Deep neural networks usually require large labeled datasets for training to achieve the start-of-the-art performance in many tasks, such as image classification and natural language processing. Though a lot of data is created each day by active Internet users through various distributed systems across the world, most of these data are unlabeled and are vulnerable to data poisoning attacks. In this paper, we develop an efficient active learning method that requires fewer labeled instances and incorporates the technique of adversarial retraining in which additional labeled artificial data are generated without increasing the labeling budget. The generated adversarial examples also provide a way to measure the vulnerability of the model. To check the performance of the proposed method under an adversarial setting, i.e., malicious mislabeling and data poisoning attacks, we perform an extensive evaluation on the reduced CIFAR-10 dataset, which contains only two classes: 'airplane' and 'frog' by using the private cloud on campus. Our experimental results demonstrate that the proposed active learning method is efficient for defending against malicious mislabeling and data poisoning attacks. Specifically, whereas the baseline active learning method based on the random sampling strategy performs poorly (about 50%) under a malicious mislabeling attack, the proposed active learning method can achieve the desired accuracy of 89% using only one-third of the dataset on average.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2101.00157 [cs.LG]
	(or arXiv:2101.00157v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2101.00157

Submission history

From: Jing Lin [view email]
[v1] Fri, 1 Jan 2021 03:43:36 UTC (1,368 KB)
[v2] Wed, 24 Mar 2021 01:07:29 UTC (1,383 KB)
[v3] Sun, 16 May 2021 20:06:13 UTC (1,378 KB)
[v4] Thu, 2 Sep 2021 04:12:13 UTC (962 KB)

Computer Science > Machine Learning

Title:Active Learning Under Malicious Mislabeling and Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Active Learning Under Malicious Mislabeling and Poisoning Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators