Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Subbiah, Melanie; Mishra, Akankshya; Kim, Grace; Tang, Liyan; Durrett, Greg; McKeown, Kathleen

Computer Science > Computation and Language

arXiv:2504.01132 (cs)

[Submitted on 1 Apr 2025]

Title:Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Authors:Melanie Subbiah, Akankshya Mishra, Grace Kim, Liyan Tang, Greg Durrett, Kathleen McKeown

View PDF HTML (experimental)

Abstract:Determining faithfulness of a claim to a source document is an important problem across many domains. This task is generally treated as a binary judgment of whether the claim is supported or unsupported in relation to the source. In many cases, though, whether a claim is supported can be ambiguous. For instance, it may depend on making inferences from given evidence, and different people can reasonably interpret the claim as either supported or unsupported based on their agreement with those inferences. Forcing binary labels upon such claims lowers the reliability of evaluation. In this work, we reframe the task to manage the subjectivity involved with factuality judgments of ambiguous claims. We introduce LLM-generated edits of summaries as a method of providing a nuanced evaluation of claims: how much does a summary need to be edited to be unambiguous? Whether a claim gets rewritten and how much it changes can be used as an automatic evaluation metric, the Ambiguity Rewrite Metric (ARM), with a much richer feedback signal than a binary judgment of faithfulness. We focus on the area of narrative summarization as it is particularly rife with ambiguity and subjective interpretation. We show that ARM produces a 21% absolute improvement in annotator agreement on claim faithfulness, indicating that subjectivity is reduced.

Comments:	Preprint
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2504.01132 [cs.CL]
	(or arXiv:2504.01132v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2504.01132

Submission history

From: Melanie Subbiah [view email]
[v1] Tue, 1 Apr 2025 19:08:24 UTC (4,129 KB)

Computer Science > Computation and Language

Title:Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators