INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation

Hu, Jian; Cheng, Zixu; Gong, Shaogang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.18753 (cs)

[Submitted on 30 Jan 2025]

Title:INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation

Authors:Jian Hu, Zixu Cheng, Shaogang Gong

View PDF HTML (experimental)

Abstract:Task-generic promptable image segmentation aims to achieve segmentation of diverse samples under a single task description by utilizing only one task-generic prompt. Current methods leverage the generalization capabilities of Vision-Language Models (VLMs) to infer instance-specific prompts from these task-generic prompts in order to guide the segmentation process. However, when VLMs struggle to generalise to some image instances, predicting instance-specific prompts becomes poor. To solve this problem, we introduce \textbf{I}nstance-specific \textbf{N}egative Mining for \textbf{T}ask-Generic Promptable Segmentation (\textbf{INT}). The key idea of INT is to adaptively reduce the influence of irrelevant (negative) prior knowledge whilst to increase the use the most plausible prior knowledge, selected by negative mining with higher contrast, in order to optimise instance-specific prompts generation. Specifically, INT consists of two components: (1) instance-specific prompt generation, which progressively fliters out incorrect information in prompt generation; (2) semantic mask generation, which ensures each image instance segmentation matches correctly the semantics of the instance-specific prompts. INT is validated on six datasets, including camouflaged objects and medical images, demonstrating its effectiveness, robustness and scalability.

Comments:	A new task-generic promptable segmentation approach
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.18753 [cs.CV]
	(or arXiv:2501.18753v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.18753

Submission history

From: Jian Hu [view email]
[v1] Thu, 30 Jan 2025 21:07:14 UTC (7,381 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:INT: Instance-Specific Negative Mining for Task-Generic Promptable Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators