Visual Prompt Selection for In-Context Learning Segmentation

Suo, Wei; Lai, Lanqing; Sun, Mengyang; Zhang, Hanwang; Wang, Peng; Zhang, Yanning

Computer Science > Computer Vision and Pattern Recognition

arXiv:2407.10233 (cs)

[Submitted on 14 Jul 2024]

Title:Visual Prompt Selection for In-Context Learning Segmentation

Authors:Wei Suo, Lanqing Lai, Mengyang Sun, Hanwang Zhang, Peng Wang, Yanning Zhang

View PDF HTML (experimental)

Abstract:As a fundamental and extensively studied task in computer vision, image segmentation aims to locate and identify different semantic concepts at the pixel level. Recently, inspired by In-Context Learning (ICL), several generalist segmentation frameworks have been proposed, providing a promising paradigm for segmenting specific objects. However, existing works mostly ignore the value of visual prompts or simply apply similarity sorting to select contextual examples. In this paper, we focus on rethinking and improving the example selection strategy. By comprehensive comparisons, we first demonstrate that ICL-based segmentation models are sensitive to different contexts. Furthermore, empirical evidence indicates that the diversity of contextual prompts plays a crucial role in guiding segmentation. Based on the above insights, we propose a new stepwise context search method. Different from previous works, we construct a small yet rich candidate pool and adaptively search the well-matched contexts. More importantly, this method effectively reduces the annotation cost by compacting the search space. Extensive experiments show that our method is an effective strategy for selecting examples and enhancing segmentation performance.

Comments:	Accept by ECCV2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2407.10233 [cs.CV]
	(or arXiv:2407.10233v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.10233

Submission history

From: Wei Suo [view email]
[v1] Sun, 14 Jul 2024 15:02:54 UTC (4,841 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Visual Prompt Selection for In-Context Learning Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Visual Prompt Selection for In-Context Learning Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators