SEAL : Interactive Tool for Systematic Error Analysis and Labeling

Rajani, Nazneen; Liang, Weixin; Chen, Lingjiao; Mitchell, Meg; Zou, James

Computer Science > Computation and Language

arXiv:2210.05839 (cs)

[Submitted on 11 Oct 2022]

Title:SEAL : Interactive Tool for Systematic Error Analysis and Labeling

Authors:Nazneen Rajani, Weixin Liang, Lingjiao Chen, Meg Mitchell, James Zou

View PDF

Abstract:With the advent of Transformers, large language models (LLMs) have saturated well-known NLP benchmarks and leaderboards with high aggregate performance. However, many times these models systematically fail on tail data or rare groups not obvious in aggregate evaluation. Identifying such problematic data groups is even more challenging when there are no explicit labels (e.g., ethnicity, gender, etc.) and further compounded for NLP datasets due to the lack of visual features to characterize failure modes (e.g., Asian males, animals indoors, waterbirds on land, etc.). This paper introduces an interactive Systematic Error Analysis and Labeling (\seal) tool that uses a two-step approach to first identify high error slices of data and then, in the second step, introduce methods to give human-understandable semantics to those underperforming slices. We explore a variety of methods for coming up with coherent semantics for the error groups using language models for semantic labeling and a text-to-image model for generating visual features. SEAL toolkit and demo screencast is available at this https URL.

Comments:	Accepted at EMNLP 2022 demo track
Subjects:	Computation and Language (cs.CL); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2210.05839 [cs.CL]
	(or arXiv:2210.05839v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.05839

Submission history

From: Nazneen Fatema Rajani [view email]
[v1] Tue, 11 Oct 2022 23:51:44 UTC (5,044 KB)

Computer Science > Computation and Language

Title:SEAL : Interactive Tool for Systematic Error Analysis and Labeling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:SEAL : Interactive Tool for Systematic Error Analysis and Labeling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators