Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector

Zhang, Xianren; Lee, Dongwon; Wang, Suhang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.19308 (cs)

[Submitted on 27 Jul 2024 (v1), last revised 6 Aug 2024 (this version, v2)]

Title:Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector

Authors:Xianren Zhang, Dongwon Lee, Suhang Wang

View PDF HTML (experimental)

Abstract:As deep vision models' popularity rapidly increases, there is a growing emphasis on explanations for model predictions. The inherently explainable attribution method aims to enhance the understanding of model behavior by identifying the important regions in images that significantly contribute to predictions. It is achieved by cooperatively training a selector (generating an attribution map to identify important features) and a predictor (making predictions using the identified features). Despite many advancements, existing methods suffer from the incompleteness problem, where discriminative features are masked out, and the interlocking problem, where the non-optimized selector initially selects noise, causing the predictor to fit on this noise and perpetuate the cycle. To address these problems, we introduce a new objective that discourages the presence of discriminative features in the masked-out regions thus enhancing the comprehensiveness of feature selection. A pre-trained detector is introduced to detect discriminative features in the masked-out region. If the selector selects noise instead of discriminative features, the detector can observe and break the interlocking situation by penalizing the selector. Extensive experiments show that our model makes accurate predictions with higher accuracy than the regular black-box model, and produces attribution maps with high feature coverage, localization ability, fidelity and robustness. Our code will be available at \href{this https URL}{this https URL}.

Comments:	Accepted as a conference paper by ECCV 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.19308 [cs.CV]
	(or arXiv:2407.19308v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.19308

Submission history

From: Xianren Zhang [view email]
[v1] Sat, 27 Jul 2024 17:45:20 UTC (4,105 KB)
[v2] Tue, 6 Aug 2024 17:22:17 UTC (4,105 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators