Towards Counterfactual and Contrastive Explainability and Transparency of DCNN Image Classifiers

Tariq, Syed Ali; Zia, Tehseen; Ghafoor, Mubeen

doi:10.1016/j.knosys.2022.109901

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.06831 (cs)

[Submitted on 12 Jan 2025]

Title:Towards Counterfactual and Contrastive Explainability and Transparency of DCNN Image Classifiers

Authors:Syed Ali Tariq, Tehseen Zia, Mubeen Ghafoor

View PDF HTML (experimental)

Abstract:Explainability of deep convolutional neural networks (DCNNs) is an important research topic that tries to uncover the reasons behind a DCNN model's decisions and improve their understanding and reliability in high-risk environments. In this regard, we propose a novel method for generating interpretable counterfactual and contrastive explanations for DCNN models. The proposed method is model intrusive that probes the internal workings of a DCNN instead of altering the input image to generate explanations. Given an input image, we provide contrastive explanations by identifying the most important filters in the DCNN representing features and concepts that separate the model's decision between classifying the image to the original inferred class or some other specified alter class. On the other hand, we provide counterfactual explanations by specifying the minimal changes necessary in such filters so that a contrastive output is obtained.
Using these identified filters and concepts, our method can provide contrastive and counterfactual reasons behind a model's decisions and makes the model more transparent. One of the interesting applications of this method is misclassification analysis, where we compare the identified concepts from a particular input image and compare them with class-specific concepts to establish the validity of the model's decisions. The proposed method is compared with state-of-the-art and evaluated on the Caltech-UCSD Birds (CUB) 2011 dataset to show the usefulness of the explanations provided.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2501.06831 [cs.CV]
	(or arXiv:2501.06831v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.06831
Journal reference:	Knowledge-Based Systems, Volume 257, 2022, 109901, ISSN 0950-7051
Related DOI:	https://doi.org/10.1016/j.knosys.2022.109901

Submission history

From: Syed Ali Tariq [view email]
[v1] Sun, 12 Jan 2025 14:54:02 UTC (1,558 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Counterfactual and Contrastive Explainability and Transparency of DCNN Image Classifiers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Counterfactual and Contrastive Explainability and Transparency of DCNN Image Classifiers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators