On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification

Wu, Zhengxuan; Ong, Desmond C.

Computer Science > Computation and Language

arXiv:2101.00196 (cs)

[Submitted on 1 Jan 2021]

Title:On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification

Authors:Zhengxuan Wu, Desmond C. Ong

View PDF

Abstract:BERT, as one of the pretrianed language models, attracts the most attention in recent years for creating new benchmarks across GLUE tasks via fine-tuning. One pressing issue is to open up the blackbox and explain the decision makings of BERT. A number of attribution techniques have been proposed to explain BERT models, but are often limited to sequence to sequence tasks. In this paper, we adapt existing attribution methods on explaining decision makings of BERT in sequence classification tasks. We conduct extensive analyses of four existing attribution methods by applying them to four different datasets in sentiment analysis. We compare the reliability and robustness of each method via various ablation studies. Furthermore, we test whether attribution methods explain generalized semantics across semantically similar tasks. Our work provides solid guidance for using attribution methods to explain decision makings of BERT for downstream classification tasks.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2101.00196 [cs.CL]
	(or arXiv:2101.00196v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2101.00196

Submission history

From: Zhengxuan Wu [view email]
[v1] Fri, 1 Jan 2021 08:45:32 UTC (9,353 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhengxuan Wu
Desmond C. Ong

export BibTeX citation

Computer Science > Computation and Language

Title:On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:On Explaining Your Explanations of BERT: An Empirical Study with Sequence Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators