Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain

Tian, Jinyu; Zhou, Jiantao; Li, Yuanman; Duan, Jia

Computer Science > Machine Learning

arXiv:2103.04302 (cs)

[Submitted on 7 Mar 2021]

Title:Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain

Authors:Jinyu Tian, Jiantao Zhou, Yuanman Li, Jia Duan

View PDF

Abstract:Deep neural networks (DNNs) have been shown to be vulnerable against adversarial examples (AEs), which are maliciously designed to cause dramatic model output errors. In this work, we reveal that normal examples (NEs) are insensitive to the fluctuations occurring at the highly-curved region of the decision boundary, while AEs typically designed over one single domain (mostly spatial domain) exhibit exorbitant sensitivity on such fluctuations. This phenomenon motivates us to design another classifier (called dual classifier) with transformed decision boundary, which can be collaboratively used with the original classifier (called primal classifier) to detect AEs, by virtue of the sensitivity inconsistency. When comparing with the state-of-the-art algorithms based on Local Intrinsic Dimensionality (LID), Mahalanobis Distance (MD), and Feature Squeezing (FS), our proposed Sensitivity Inconsistency Detector (SID) achieves improved AE detection performance and superior generalization capabilities, especially in the challenging cases where the adversarial perturbation levels are small. Intensive experimental results on ResNet and VGG validate the superiority of the proposed SID.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2103.04302 [cs.LG]
	(or arXiv:2103.04302v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2103.04302

Submission history

From: Jinyu Tian [view email]
[v1] Sun, 7 Mar 2021 08:43:22 UTC (886 KB)

Computer Science > Machine Learning

Title:Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Detecting Adversarial Examples from Sensitivity Inconsistency of Spatial-Transform Domain

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators