SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models

Zou, Shu; Tian, Xinyu; Zhao, Qinyu; Yang, Zhaoyuan; Zhang, Jing

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.11485 (cs)

[Submitted on 20 Jan 2025]

Title:SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models

Authors:Shu Zou, Xinyu Tian, Qinyu Zhao, Zhaoyuan Yang, Jing Zhang

View PDF HTML (experimental)

Abstract:Detecting out-of-distribution (OOD) data is crucial in real-world machine learning applications, particularly in safety-critical domains. Existing methods often leverage language information from vision-language models (VLMs) to enhance OOD detection by improving confidence estimation through rich class-wise text information. However, when building OOD detection score upon on in-distribution (ID) text-image affinity, existing works either focus on each ID class or whole ID label sets, overlooking inherent ID classes' connection. We find that the semantic information across different ID classes is beneficial for effective OOD detection. We thus investigate the ability of image-text comprehension among different semantic-related ID labels in VLMs and propose a novel post-hoc strategy called SimLabel. SimLabel enhances the separability between ID and OOD samples by establishing a more robust image-class similarity metric that considers consistency over a set of similar class labels. Extensive experiments demonstrate the superior performance of SimLabel on various zero-shot OOD detection benchmarks. The proposed model is also extended to various VLM-backbones, demonstrating its good generalization ability. Our demonstration and implementation codes are available at: this https URL.

Comments:	10 pages
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.11485 [cs.CV]
	(or arXiv:2501.11485v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.11485

Submission history

From: Jing Zhang [view email]
[v1] Mon, 20 Jan 2025 13:36:30 UTC (4,340 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SimLabel: Consistency-Guided OOD Detection with Pretrained Vision-Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators