ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability

Poché, Antonin; Jacovi, Alon; Picard, Agustin Martin; Boutin, Victor; Jourdan, Fanny

Computer Science > Computation and Language

arXiv:2501.05855 (cs)

[Submitted on 10 Jan 2025 (v1), last revised 13 Jan 2025 (this version, v2)]

Title:ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability

Authors:Antonin Poché (IRIT), Alon Jacovi, Agustin Martin Picard, Victor Boutin (CERCO UMR5549, ANITI), Fanny Jourdan

View PDF

Abstract:Concept-based explanations work by mapping complex model computations to human-understandable concepts. Evaluating such explanations is very difficult, as it includes not only the quality of the induced space of possible concepts but also how effectively the chosen concepts are communicated to users. Existing evaluation metrics often focus solely on the former, neglecting the latter. We introduce an evaluation framework for measuring concept explanations via automated simulatability: a simulator's ability to predict the explained model's outputs based on the provided explanations. This approach accounts for both the concept space and its interpretation in an end-to-end evaluation. Human studies for simulatability are notoriously difficult to enact, particularly at the scale of a wide, comprehensive empirical evaluation (which is the subject of this work). We propose using large language models (LLMs) as simulators to approximate the evaluation and report various analyses to make such approximations reliable. Our method allows for scalable and consistent evaluation across various models and datasets. We report a comprehensive empirical evaluation using this framework and show that LLMs provide consistent rankings of explanation methods. Code available at this https URL.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2501.05855 [cs.CL]
	(or arXiv:2501.05855v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.05855

Submission history

From: Antonin Poche [view email] [via CCSD proxy]
[v1] Fri, 10 Jan 2025 10:53:48 UTC (2,858 KB)
[v2] Mon, 13 Jan 2025 10:39:54 UTC (2,858 KB)

Computer Science > Computation and Language

Title:ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ConSim: Measuring Concept-Based Explanations' Effectiveness with Automated Simulatability

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators