Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks

Liu, Kang; Dolan-Gavitt, Brendan; Garg, Siddharth

Computer Science > Cryptography and Security

arXiv:1805.12185 (cs)

[Submitted on 30 May 2018]

Title:Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks

Authors:Kang Liu, Brendan Dolan-Gavitt, Siddharth Garg

View PDF

Abstract:Deep neural networks (DNNs) provide excellent performance across a wide range of classification tasks, but their training requires high computational resources and is often outsourced to third parties. Recent work has shown that outsourced training introduces the risk that a malicious trainer will return a backdoored DNN that behaves normally on most inputs but causes targeted misclassifications or degrades the accuracy of the network when a trigger known only to the attacker is present. In this paper, we provide the first effective defenses against backdoor attacks on DNNs. We implement three backdoor attacks from prior work and use them to investigate two promising defenses, pruning and fine-tuning. We show that neither, by itself, is sufficient to defend against sophisticated attackers. We then evaluate fine-pruning, a combination of pruning and fine-tuning, and show that it successfully weakens or even eliminates the backdoors, i.e., in some cases reducing the attack success rate to 0% with only a 0.4% drop in accuracy for clean (non-triggering) inputs. Our work provides the first step toward defenses against backdoor attacks in deep neural networks.

Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:1805.12185 [cs.CR]
	(or arXiv:1805.12185v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.1805.12185

Submission history

From: Kang Liu [view email]
[v1] Wed, 30 May 2018 19:13:00 UTC (5,835 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CR

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Kang Liu
Brendan Dolan-Gavitt
Siddharth Garg

export BibTeX citation

Computer Science > Cryptography and Security

Title:Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators