Defending against Adversarial Attacks through Resilient Feature Regeneration

Borkar, Tejas; Heide, Felix; Karam, Lina

Computer Science > Computer Vision and Pattern Recognition

arXiv:1906.03444v1 (cs)

[Submitted on 8 Jun 2019 (this version), latest version 11 Jun 2020 (v4)]

Title:Defending against Adversarial Attacks through Resilient Feature Regeneration

Authors:Tejas Borkar, Felix Heide, Lina Karam

View PDF

Abstract:Deep neural network (DNN) predictions have been shown to be vulnerable to carefully crafted adversarial perturbations. Specifically, so-called universal adversarial perturbations are image-agnostic perturbations that can be added to any image and can fool a target network into making erroneous predictions. Departing from existing adversarial defense strategies, which work in the image domain, we present a novel defense which operates in the DNN feature domain and effectively defends against such universal adversarial attacks. Our approach identifies pre-trained convolutional features that are most vulnerable to adversarial noise and deploys defender units which transform (regenerate) these DNN filter activations into noise-resilient features, guarding against unseen adversarial perturbations. The proposed defender units are trained using a target loss on synthetic adversarial perturbations, which we generate with a novel efficient synthesis method. We validate the proposed method for different DNN architectures, and demonstrate that it outperforms existing defense strategies across network architectures by more than 10% in restored accuracy. Moreover, we demonstrate that the approach also improves resilience of DNNs to other unseen adversarial attacks.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1906.03444 [cs.CV]
	(or arXiv:1906.03444v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1906.03444

Submission history

From: Tejas Borkar [view email]
[v1] Sat, 8 Jun 2019 12:18:13 UTC (6,695 KB)
[v2] Sat, 23 Nov 2019 09:53:52 UTC (7,835 KB)
[v3] Tue, 25 Feb 2020 06:41:51 UTC (6,321 KB)
[v4] Thu, 11 Jun 2020 02:40:33 UTC (6,321 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Defending against Adversarial Attacks through Resilient Feature Regeneration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Defending against Adversarial Attacks through Resilient Feature Regeneration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators