On Model Explanations with Transferable Neural Pathways

Lin, Xinmiao; Bao, Wentao; Yu, Qi; Kong, Yu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.09887 (cs)

[Submitted on 18 Sep 2023]

Title:On Model Explanations with Transferable Neural Pathways

Authors:Xinmiao Lin, Wentao Bao, Qi Yu, Yu Kong

View PDF

Abstract:Neural pathways as model explanations consist of a sparse set of neurons that provide the same level of prediction performance as the whole model. Existing methods primarily focus on accuracy and sparsity but the generated pathways may offer limited interpretability thus fall short in explaining the model behavior. In this paper, we suggest two interpretability criteria of neural pathways: (i) same-class neural pathways should primarily consist of class-relevant neurons; (ii) each instance's neural pathway sparsity should be optimally determined. To this end, we propose a Generative Class-relevant Neural Pathway (GEN-CNP) model that learns to predict the neural pathways from the target model's feature maps. We propose to learn class-relevant information from features of deep and shallow layers such that same-class neural pathways exhibit high similarity. We further impose a faithfulness criterion for GEN-CNP to generate pathways with instance-specific sparsity. We propose to transfer the class-relevant neural pathways to explain samples of the same class and show experimentally and qualitatively their faithfulness and interpretability.

Comments:	Arxiv preprint
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2309.09887 [cs.CV]
	(or arXiv:2309.09887v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.09887

Submission history

From: Xinmiao Lin [view email]
[v1] Mon, 18 Sep 2023 15:50:38 UTC (32,531 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:On Model Explanations with Transferable Neural Pathways

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:On Model Explanations with Transferable Neural Pathways

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators