Bridging Interpretability and Robustness Using LIME-Guided Model Refinement

Nayyem, Navid; Rakin, Abdullah; Wang, Longwei

Computer Science > Machine Learning

arXiv:2412.18952 (cs)

[Submitted on 25 Dec 2024]

Title:Bridging Interpretability and Robustness Using LIME-Guided Model Refinement

Authors:Navid Nayyem, Abdullah Rakin, Longwei Wang

View PDF HTML (experimental)

Abstract:This paper explores the intricate relationship between interpretability and robustness in deep learning models. Despite their remarkable performance across various tasks, deep learning models often exhibit critical vulnerabilities, including susceptibility to adversarial attacks, over-reliance on spurious correlations, and a lack of transparency in their decision-making processes. To address these limitations, we propose a novel framework that leverages Local Interpretable Model-Agnostic Explanations (LIME) to systematically enhance model robustness. By identifying and mitigating the influence of irrelevant or misleading features, our approach iteratively refines the model, penalizing reliance on these features during training. Empirical evaluations on multiple benchmark datasets demonstrate that LIME-guided refinement not only improves interpretability but also significantly enhances resistance to adversarial perturbations and generalization to out-of-distribution data.

Comments:	10 pages, 15 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.18952 [cs.LG]
	(or arXiv:2412.18952v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.18952

Submission history

From: Longwei Wang [view email]
[v1] Wed, 25 Dec 2024 17:32:45 UTC (441 KB)

Computer Science > Machine Learning

Title:Bridging Interpretability and Robustness Using LIME-Guided Model Refinement

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Bridging Interpretability and Robustness Using LIME-Guided Model Refinement

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators