A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection

Mok, Tsui Qin; Gao, Shuyong; Xing, Haozhe; He, Miaoyang; Wang, Yan; Zhang, Wenqiang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.06038 (cs)

[Submitted on 10 Jan 2025]

Title:A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection

Authors:Tsui Qin Mok, Shuyong Gao, Haozhe Xing, Miaoyang He, Yan Wang, Wenqiang Zhang

View PDF HTML (experimental)

Abstract:Weakly-Supervised Camouflaged Object Detection (WSCOD) has gained popularity for its promise to train models with weak labels to segment objects that visually blend into their surroundings. Recently, some methods using sparsely-annotated supervision shown promising results through scribbling in WSCOD, while point-text supervision remains underexplored. Hence, this paper introduces a novel holistically point-guided text framework for WSCOD by decomposing into three phases: segment, choose, train. Specifically, we propose Point-guided Candidate Generation (PCG), where the point's foreground serves as a correction for the text path to explicitly correct and rejuvenate the loss detection object during the mask generation process (SEGMENT). We also introduce a Qualified Candidate Discriminator (QCD) to choose the optimal mask from a given text prompt using CLIP (CHOOSE), and employ the chosen pseudo mask for training with a self-supervised Vision Transformer (TRAIN). Additionally, we developed a new point-supervised dataset (P2C-COD) and a text-supervised dataset (T-COD). Comprehensive experiments on four benchmark datasets demonstrate our method outperforms state-of-the-art methods by a large margin, and also outperforms some existing fully-supervised camouflaged object detection methods.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.06038 [cs.CV]
	(or arXiv:2501.06038v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.06038

Submission history

From: Shuyong Gao [view email]
[v1] Fri, 10 Jan 2025 15:17:02 UTC (9,570 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Holistically Point-guided Text Framework for Weakly-Supervised Camouflaged Object Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators