Understanding Multi-Granularity for Open-Vocabulary Part Segmentation

Choi, Jiho; Lee, Seonho; Lee, Seungho; Lee, Minhyun; Shim, Hyunjung

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.11384 (cs)

[Submitted on 17 Jun 2024 (v1), last revised 2 Nov 2024 (this version, v2)]

Title:Understanding Multi-Granularity for Open-Vocabulary Part Segmentation

Authors:Jiho Choi, Seonho Lee, Seungho Lee, Minhyun Lee, Hyunjung Shim

View PDF HTML (experimental)

Abstract:Open-vocabulary part segmentation (OVPS) is an emerging research area focused on segmenting fine-grained entities using diverse and previously unseen vocabularies. Our study highlights the inherent complexities of part segmentation due to intricate boundaries and diverse granularity, reflecting the knowledge-based nature of part identification. To address these challenges, we propose PartCLIPSeg, a novel framework utilizing generalized parts and object-level contexts to mitigate the lack of generalization in fine-grained parts. PartCLIPSeg integrates competitive part relationships and attention control, alleviating ambiguous boundaries and underrepresented parts. Experimental results demonstrate that PartCLIPSeg outperforms existing state-of-the-art OVPS methods, offering refined segmentation and an advanced understanding of part relationships within images. Through extensive experiments, our model demonstrated a significant improvement over the state-of-the-art models on the Pascal-Part-116, ADE20K-Part-234, and PartImageNet datasets.

Comments:	NeurIPS 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.11384 [cs.CV]
	(or arXiv:2406.11384v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.11384

Submission history

From: Jiho Choi [view email]
[v1] Mon, 17 Jun 2024 10:11:28 UTC (14,436 KB)
[v2] Sat, 2 Nov 2024 11:22:40 UTC (56,356 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding Multi-Granularity for Open-Vocabulary Part Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding Multi-Granularity for Open-Vocabulary Part Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators