Explaining Machine Learning Models using Entropic Variable Projection

Bachoc, François; Gamboa, Fabrice; Halford, Max; Loubes, Jean-Michel; Risser, Laurent

Statistics > Machine Learning

arXiv:1810.07924 (stat)

[Submitted on 18 Oct 2018 (v1), last revised 11 Aug 2022 (this version, v6)]

Title:Explaining Machine Learning Models using Entropic Variable Projection

Authors:François Bachoc (IMT), Fabrice Gamboa (IMT), Max Halford (IMT, IRIT), Jean-Michel Loubes (IMT), Laurent Risser (IMT)

View PDF

Abstract:In this paper, we present a new explainability formalism designed to shed light on how each input variable of a test set impacts the predictions of machine learning models. Hence, we propose a group explainability formalism for trained machine learning decision rules, based on their response to the variability of the input variables distribution. In order to emphasize the impact of each input variable, this formalism uses an information theory framework that quantifies the influence of all input-output observations based on entropic projections. This is thus the first unified and model agnostic formalism enabling data scientists to interpret the dependence between the input variables, their impact on the prediction errors, and their influence on the output predictions. Convergence rates of the entropic projections are provided in the large sample case. Most importantly, we prove that computing an explanation in our framework has a low algorithmic complexity, making it scalable to real-life large datasets. We illustrate our strategy by explaining complex decision rules learned by using XGBoost, Random Forest or Deep Neural Network classifiers on various datasets such as Adult Income, MNIST, CelebA, Boston Housing, Iris, as well as synthetic ones. We finally make clear its differences with the explainability strategies LIME and SHAP, that are based on single observations. Results can be reproduced by using the freely distributed Python toolbox this https URL.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1810.07924 [stat.ML]
	(or arXiv:1810.07924v6 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1810.07924

Submission history

From: François Bachoc [view email] [via CCSD proxy]
[v1] Thu, 18 Oct 2018 07:04:39 UTC (80 KB)
[v2] Fri, 26 Jul 2019 17:47:22 UTC (518 KB)
[v3] Tue, 4 Feb 2020 12:44:12 UTC (820 KB)
[v4] Fri, 26 Jun 2020 11:41:16 UTC (796 KB)
[v5] Wed, 2 Dec 2020 14:29:31 UTC (1,127 KB)
[v6] Thu, 11 Aug 2022 13:14:38 UTC (1,393 KB)

Statistics > Machine Learning

Title:Explaining Machine Learning Models using Entropic Variable Projection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Explaining Machine Learning Models using Entropic Variable Projection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators