Sufficient and Necessary Explanations (and What Lies in Between)

Bharti, Beepul; Yi, Paul; Sulam, Jeremias

Statistics > Machine Learning

arXiv:2409.20427 (stat)

[Submitted on 30 Sep 2024 (v1), last revised 15 Oct 2024 (this version, v2)]

Title:Sufficient and Necessary Explanations (and What Lies in Between)

Authors:Beepul Bharti, Paul Yi, Jeremias Sulam

View PDF HTML (experimental)

Abstract:As complex machine learning models continue to find applications in high-stakes decision-making scenarios, it is crucial that we can explain and understand their predictions. Post-hoc explanation methods provide useful insights by identifying important features in an input $\mathbf{x}$ with respect to the model output $f(\mathbf{x})$. In this work, we formalize and study two precise notions of feature importance for general machine learning models: sufficiency and necessity. We demonstrate how these two types of explanations, albeit intuitive and simple, can fall short in providing a complete picture of which features a model finds important. To this end, we propose a unified notion of importance that circumvents these limitations by exploring a continuum along a necessity-sufficiency axis. Our unified notion, we show, has strong ties to other popular definitions of feature importance, like those based on conditional independence and game-theoretic quantities like Shapley values. Crucially, we demonstrate how a unified perspective allows us to detect important features that could be missed by either of the previous approaches alone.

Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2409.20427 [stat.ML]
	(or arXiv:2409.20427v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2409.20427

Submission history

From: Beepul Bharti [view email]
[v1] Mon, 30 Sep 2024 15:50:57 UTC (7,414 KB)
[v2] Tue, 15 Oct 2024 14:04:35 UTC (12,161 KB)

Statistics > Machine Learning

Title:Sufficient and Necessary Explanations (and What Lies in Between)

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Sufficient and Necessary Explanations (and What Lies in Between)

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators