Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

Wójcik, Bartosz; Morawiecki, Paweł; Śmieja, Marek; Krzyżek, Tomasz; Spurek, Przemysław; Tabor, Jacek

Computer Science > Machine Learning

arXiv:2006.10013 (cs)

[Submitted on 17 Jun 2020]

Title:Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

Authors:Bartosz Wójcik, Paweł Morawiecki, Marek Śmieja, Tomasz Krzyżek, Przemysław Spurek, Jacek Tabor

View PDF

Abstract:We present a mechanism for detecting adversarial examples based on data representations taken from the hidden layers of the target network. For this purpose, we train individual autoencoders at intermediate layers of the target network. This allows us to describe the manifold of true data and, in consequence, decide whether a given example has the same characteristics as true data. It also gives us insight into the behavior of adversarial examples and their flow through the layers of a deep neural network. Experimental results show that our method outperforms the state of the art in supervised and unsupervised settings.

Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR); Machine Learning (stat.ML)
Cite as:	arXiv:2006.10013 [cs.LG]
	(or arXiv:2006.10013v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.10013

Submission history

From: Bartosz Wójcik [view email]
[v1] Wed, 17 Jun 2020 17:17:54 UTC (9,248 KB)

Computer Science > Machine Learning

Title:Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adversarial Examples Detection and Analysis with Layer-wise Autoencoders

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators