Q-SENN: Quantized Self-Explaining Neural Networks

Norrenbrock, Thomas; Rudolph, Marco; Rosenhahn, Bodo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.13839 (cs)

[Submitted on 21 Dec 2023 (v1), last revised 16 Feb 2024 (this version, v2)]

Title:Q-SENN: Quantized Self-Explaining Neural Networks

Authors:Thomas Norrenbrock, Marco Rudolph, Bodo Rosenhahn

View PDF

Abstract:Explanations in Computer Vision are often desired, but most Deep Neural Networks can only provide saliency maps with questionable faithfulness. Self-Explaining Neural Networks (SENN) extract interpretable concepts with fidelity, diversity, and grounding to combine them linearly for decision-making. While they can explain what was recognized, initial realizations lack accuracy and general applicability. We propose the Quantized-Self-Explaining Neural Network Q-SENN. Q-SENN satisfies or exceeds the desiderata of SENN while being applicable to more complex datasets and maintaining most or all of the accuracy of an uninterpretable baseline model, out-performing previous work in all considered metrics. Q-SENN describes the relationship between every class and feature as either positive, negative or neutral instead of an arbitrary number of possible relations, enforcing more binary human-friendly features. Since every class is assigned just 5 interpretable features on average, Q-SENN shows convincing local and global interpretability. Additionally, we propose a feature alignment method, capable of aligning learned features with human language-based concepts without additional supervision. Thus, what is learned can be more easily verbalized. The code is published: this https URL

Comments:	Accepted to AAAI 2024, SRRAI
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2312.13839 [cs.CV]
	(or arXiv:2312.13839v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.13839

Submission history

From: Thomas Norrenbrock [view email]
[v1] Thu, 21 Dec 2023 13:39:18 UTC (37,535 KB)
[v2] Fri, 16 Feb 2024 11:18:30 UTC (38,891 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Q-SENN: Quantized Self-Explaining Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Q-SENN: Quantized Self-Explaining Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators