From Visual Explanations to Counterfactual Explanations with Latent Diffusion

Luu, Tung; Le, Nam; Le, Duc; Le, Bac

doi:10.1109/WACV61041.2025.00051

Computer Science > Computer Vision and Pattern Recognition

arXiv:2504.09202 (cs)

[Submitted on 12 Apr 2025]

Title:From Visual Explanations to Counterfactual Explanations with Latent Diffusion

Authors:Tung Luu, Nam Le, Duc Le, Bac Le

View PDF HTML (experimental)

Abstract:Visual counterfactual explanations are ideal hypothetical images that change the decision-making of the classifier with high confidence toward the desired class while remaining visually plausible and close to the initial image. In this paper, we propose a new approach to tackle two key challenges in recent prominent works: i) determining which specific counterfactual features are crucial for distinguishing the "concept" of the target class from the original class, and ii) supplying valuable explanations for the non-robust classifier without relying on the support of an adversarially robust model. Our method identifies the essential region for modification through algorithms that provide visual explanations, and then our framework generates realistic counterfactual explanations by combining adversarial attacks based on pruning the adversarial gradient of the target classifier and the latent diffusion model. The proposed method outperforms previous state-of-the-art results on various evaluation criteria on ImageNet and CelebA-HQ datasets. In general, our method can be applied to arbitrary classifiers, highlight the strong association between visual and counterfactual explanations, make semantically meaningful changes from the target classifier, and provide observers with subtle counterfactual images.

Comments:	2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.09202 [cs.CV]
	(or arXiv:2504.09202v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2504.09202
Journal reference:	Proceedings of the Winter Conference on Applications of Computer Vision (WACV), 2025, pp. 420-429
Related DOI:	https://doi.org/10.1109/WACV61041.2025.00051

Submission history

From: Tung Luu Quy [view email]
[v1] Sat, 12 Apr 2025 13:04:00 UTC (8,517 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:From Visual Explanations to Counterfactual Explanations with Latent Diffusion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:From Visual Explanations to Counterfactual Explanations with Latent Diffusion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators