Sentence-Based Model Agnostic NLP Interpretability

Rychener, Yves; Renard, Xavier; Seddah, Djamé; Frossard, Pascal; Detyniecki, Marcin

Computer Science > Computation and Language

arXiv:2012.13189v1 (cs)

[Submitted on 24 Dec 2020 (this version), latest version 8 Aug 2022 (v3)]

Title:Sentence-Based Model Agnostic NLP Interpretability

Authors:Yves Rychener, Xavier Renard, Djamé Seddah, Pascal Frossard, Marcin Detyniecki

View PDF

Abstract:Today, interpretability of Black-Box Natural Language Processing (NLP) models based on surrogates, like LIME or SHAP, uses word-based sampling to build the explanations. In this paper we explore the use of sentences to tackle NLP interpretability. While this choice may seem straight forward, we show that, when using complex classifiers like BERT, the word-based approach raises issues not only of computational complexity, but also of an out of distribution sampling, eventually leading to non founded explanations. By using sentences, the altered text remains in-distribution and the dimensionality of the problem is reduced for better fidelity to the black-box at comparable computational complexity.

Subjects:	Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as:	arXiv:2012.13189 [cs.CL]
	(or arXiv:2012.13189v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2012.13189

Submission history

From: Yves Rychener [view email]
[v1] Thu, 24 Dec 2020 10:32:41 UTC (8,752 KB)
[v2] Sun, 27 Dec 2020 17:54:38 UTC (8,752 KB)
[v3] Mon, 8 Aug 2022 11:04:42 UTC (6,887 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-12

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xavier Renard
Pascal Frossard
Marcin Detyniecki

export BibTeX citation

Computer Science > Computation and Language

Title:Sentence-Based Model Agnostic NLP Interpretability

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Sentence-Based Model Agnostic NLP Interpretability

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators