Classification regions of deep neural networks

Fawzi, Alhussein; Moosavi-Dezfooli, Seyed-Mohsen; Frossard, Pascal; Soatto, Stefano

Computer Science > Computer Vision and Pattern Recognition

arXiv:1705.09552 (cs)

[Submitted on 26 May 2017]

Title:Classification regions of deep neural networks

Authors:Alhussein Fawzi, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard, Stefano Soatto

View PDF

Abstract:The goal of this paper is to analyze the geometric properties of deep neural network classifiers in the input space. We specifically study the topology of classification regions created by deep networks, as well as their associated decision boundary. Through a systematic empirical investigation, we show that state-of-the-art deep nets learn connected classification regions, and that the decision boundary in the vicinity of datapoints is flat along most directions. We further draw an essential connection between two seemingly unrelated properties of deep networks: their sensitivity to additive perturbations in the inputs, and the curvature of their decision boundary. The directions where the decision boundary is curved in fact remarkably characterize the directions to which the classifier is the most vulnerable. We finally leverage a fundamental asymmetry in the curvature of the decision boundary of deep nets, and propose a method to discriminate between original images, and images perturbed with small adversarial examples. We show the effectiveness of this purely geometric approach for detecting small adversarial perturbations in images, and for recovering the labels of perturbed images.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1705.09552 [cs.CV]
	(or arXiv:1705.09552v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1705.09552

Submission history

From: Seyed-Mohsen Moosavi-Dezfooli [view email]
[v1] Fri, 26 May 2017 12:38:48 UTC (7,531 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Classification regions of deep neural networks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Classification regions of deep neural networks

Submission history

Access Paper:

References & Citations

1 blog link

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators