RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs

Testoni, Alberto; Plank, Barbara; Fernández, Raquel

Computer Science > Computation and Language

arXiv:2412.13835 (cs)

[Submitted on 18 Dec 2024]

Title:RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs

Authors:Alberto Testoni, Barbara Plank, Raquel Fernández

View PDF HTML (experimental)

Abstract:Ambiguity resolution is key to effective communication. While humans effortlessly address ambiguity through conversational grounding strategies, the extent to which current language models can emulate these strategies remains unclear. In this work, we examine referential ambiguity in image-based question answering by introducing RACQUET, a carefully curated dataset targeting distinct aspects of ambiguity. Through a series of evaluations, we reveal significant limitations and problems of overconfidence of state-of-the-art large multimodal language models in addressing ambiguity in their responses. The overconfidence issue becomes particularly relevant for RACQUET-BIAS, a subset designed to analyze a critical yet underexplored problem: failing to address ambiguity leads to stereotypical, socially biased responses. Our results underscore the urgency of equipping models with robust strategies to deal with uncertainty without resorting to undesirable stereotypes.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.13835 [cs.CL]
	(or arXiv:2412.13835v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.13835

Submission history

From: Alberto Testoni [view email]
[v1] Wed, 18 Dec 2024 13:25:11 UTC (3,947 KB)

Computer Science > Computation and Language

Title:RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:RACQUET: Unveiling the Dangers of Overlooked Referential Ambiguity in Visual LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators