Hidden Trigger Backdoor Attacks

Saha, Aniruddha; Subramanya, Akshayvarun; Pirsiavash, Hamed

Computer Science > Computer Vision and Pattern Recognition

arXiv:1910.00033 (cs)

[Submitted on 30 Sep 2019 (v1), last revised 21 Dec 2019 (this version, v2)]

Title:Hidden Trigger Backdoor Attacks

Authors:Aniruddha Saha, Akshayvarun Subramanya, Hamed Pirsiavash

View PDF

Abstract:With the success of deep learning algorithms in various domains, studying adversarial attacks to secure deep models in real world applications has become an important research topic. Backdoor attacks are a form of adversarial attacks on deep networks where the attacker provides poisoned data to the victim to train the model with, and then activates the attack by showing a specific small trigger pattern at the test time. Most state-of-the-art backdoor attacks either provide mislabeled poisoning data that is possible to identify by visual inspection, reveal the trigger in the poisoned data, or use noise to hide the trigger. We propose a novel form of backdoor attack where poisoned data look natural with correct labels and also more importantly, the attacker hides the trigger in the poisoned data and keeps the trigger secret until the test time. We perform an extensive study on various image classification settings and show that our attack can fool the model by pasting the trigger at random locations on unseen images although the model performs well on clean data. We also show that our proposed attack cannot be easily defended using a state-of-the-art defense algorithm for backdoor attacks.

Comments:	AAAI 2020 - Main Technical Track (Oral)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1910.00033 [cs.CV]
	(or arXiv:1910.00033v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1910.00033

Submission history

From: Aniruddha Saha [view email]
[v1] Mon, 30 Sep 2019 18:03:28 UTC (1,585 KB)
[v2] Sat, 21 Dec 2019 02:13:34 UTC (1,248 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Hidden Trigger Backdoor Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Hidden Trigger Backdoor Attacks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators