Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images

Wang, Xiaoqiang; Zhu, Lei; Tang, Siliang; Fu, Huazhu; Li, Ping; Wu, Fei; Yang, Yi; Zhuang, Yueting

doi:10.1109/TIP.2021.3139232

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2201.00100 (eess)

[Submitted on 1 Jan 2022]

Title:Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images

Authors:Xiaoqiang Wang, Lei Zhu, Siliang Tang, Huazhu Fu, Ping Li, Fei Wu, Yi Yang, Yueting Zhuang

View PDF

Abstract:Training deep models for RGB-D salient object detection (SOD) often requires a large number of labeled RGB-D images. However, RGB-D data is not easily acquired, which limits the development of RGB-D SOD techniques. To alleviate this issue, we present a Dual-Semi RGB-D Salient Object Detection Network (DS-Net) to leverage unlabeled RGB images for boosting RGB-D saliency detection. We first devise a depth decoupling convolutional neural network (DDCNN), which contains a depth estimation branch and a saliency detection branch. The depth estimation branch is trained with RGB-D images and then used to estimate the pseudo depth maps for all unlabeled RGB images to form the paired data. The saliency detection branch is used to fuse the RGB feature and depth feature to predict the RGB-D saliency. Then, the whole DDCNN is assigned as the backbone in a teacher-student framework for semi-supervised learning. Moreover, we also introduce a consistency loss on the intermediate attention and saliency maps for the unlabeled data, as well as a supervised depth and saliency loss for labeled data. Experimental results on seven widely-used benchmark datasets demonstrate that our DDCNN outperforms state-of-the-art methods both quantitatively and qualitatively. We also demonstrate that our semi-supervised DS-Net can further improve the performance, even when using an RGB image with the pseudo depth map.

Comments:	Accepted by IEEE TIP
Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2201.00100 [eess.IV]
	(or arXiv:2201.00100v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2201.00100
Related DOI:	https://doi.org/10.1109/TIP.2021.3139232

Submission history

From: Xiaoqiang Wang [view email]
[v1] Sat, 1 Jan 2022 03:02:27 UTC (25,683 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Boosting RGB-D Saliency Detection by Leveraging Unlabeled RGB Images

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators