A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation

Fel, Thomas; Boutin, Victor; Moayeri, Mazda; Cadène, Rémi; Bethune, Louis; andéol, Léo; Chalvidal, Mathieu; Serre, Thomas

Computer Science > Machine Learning

arXiv:2306.07304 (cs)

[Submitted on 11 Jun 2023 (v1), last revised 29 Oct 2023 (this version, v2)]

Title:A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation

Authors:Thomas Fel, Victor Boutin, Mazda Moayeri, Rémi Cadène, Louis Bethune, Léo andéol, Mathieu Chalvidal, Thomas Serre

View PDF

Abstract:In recent years, concept-based approaches have emerged as some of the most promising explainability methods to help us interpret the decisions of Artificial Neural Networks (ANNs). These methods seek to discover intelligible visual 'concepts' buried within the complex patterns of ANN activations in two key steps: (1) concept extraction followed by (2) importance estimation. While these two steps are shared across methods, they all differ in their specific implementations. Here, we introduce a unifying theoretical framework that comprehensively defines and clarifies these two steps. This framework offers several advantages as it allows us: (i) to propose new evaluation metrics for comparing different concept extraction approaches; (ii) to leverage modern attribution methods and evaluation metrics to extend and systematically evaluate state-of-the-art concept-based approaches and importance estimation techniques; (iii) to derive theoretical guarantees regarding the optimality of such methods. We further leverage our framework to try to tackle a crucial question in explainability: how to efficiently identify clusters of data points that are classified based on a similar shared strategy. To illustrate these findings and to highlight the main strategies of a model, we introduce a visual representation called the strategic cluster graph. Finally, we present this https URL, a dedicated website that offers a complete compilation of these visualizations for all classes of the ImageNet dataset.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.07304 [cs.LG]
	(or arXiv:2306.07304v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.07304
Journal reference:	Conference on Neural Information Processing Systems (NeurIPS), 2023

Submission history

From: Thomas Fel [view email]
[v1] Sun, 11 Jun 2023 23:28:02 UTC (3,501 KB)
[v2] Sun, 29 Oct 2023 22:28:21 UTC (4,694 KB)

Computer Science > Machine Learning

Title:A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators