Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with INNs

Rombach, Robin; Esser, Patrick; Ommer, Björn

Computer Science > Computer Vision and Pattern Recognition

arXiv:2008.01777 (cs)

[Submitted on 4 Aug 2020]

Title:Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with INNs

Authors:Robin Rombach, Patrick Esser, Björn Ommer

View PDF

Abstract:To tackle increasingly complex tasks, it has become an essential ability of neural networks to learn abstract representations. These task-specific representations and, particularly, the invariances they capture turn neural networks into black box models that lack interpretability. To open such a black box, it is, therefore, crucial to uncover the different semantic concepts a model has learned as well as those that it has learned to be invariant to. We present an approach based on INNs that (i) recovers the task-specific, learned invariances by disentangling the remaining factor of variation in the data and that (ii) invertibly transforms these recovered invariances combined with the model representation into an equally expressive one with accessible semantic concepts. As a consequence, neural network representations become understandable by providing the means to (i) expose their semantic meaning, (ii) semantically modify a representation, and (iii) visualize individual learned semantic concepts and invariances. Our invertible approach significantly extends the abilities to understand black box models by enabling post-hoc interpretations of state-of-the-art networks without compromising their performance. Our implementation is available at this https URL .

Comments:	ECCV 2020. Project page and code at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2008.01777 [cs.CV]
	(or arXiv:2008.01777v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2008.01777

Submission history

From: Patrick Esser [view email]
[v1] Tue, 4 Aug 2020 19:27:46 UTC (15,550 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with INNs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with INNs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators