Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Miyai, Atsuyuki; Yang, Jingkang; Zhang, Jingyang; Ming, Yifei; Yu, Qing; Irie, Go; Li, Yixuan; Li, Hai; Liu, Ziwei; Aizawa, Kiyoharu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.20331 (cs)

[Submitted on 29 Mar 2024]

Title:Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Authors:Atsuyuki Miyai, Jingkang Yang, Jingyang Zhang, Yifei Ming, Qing Yu, Go Irie, Yixuan Li, Hai Li, Ziwei Liu, Kiyoharu Aizawa

View PDF HTML (experimental)

Abstract:This paper introduces a novel and significant challenge for Vision Language Models (VLMs), termed Unsolvable Problem Detection (UPD). UPD examines the VLM's ability to withhold answers when faced with unsolvable problems in the context of Visual Question Answering (VQA) tasks. UPD encompasses three distinct settings: Absent Answer Detection (AAD), Incompatible Answer Set Detection (IASD), and Incompatible Visual Question Detection (IVQD). To deeply investigate the UPD problem, extensive experiments indicate that most VLMs, including GPT-4V and LLaVA-Next-34B, struggle with our benchmarks to varying extents, highlighting significant room for the improvements. To address UPD, we explore both training-free and training-based solutions, offering new insights into their effectiveness and limitations. We hope our insights, together with future efforts within the proposed UPD settings, will enhance the broader understanding and development of more practical and reliable VLMs.

Comments:	Code: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2403.20331 [cs.CV]
	(or arXiv:2403.20331v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.20331

Submission history

From: Atsuyuki Miyai [view email]
[v1] Fri, 29 Mar 2024 17:59:53 UTC (5,256 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators