CI-Net: Contextual Information for Joint Semantic Segmentation and Depth Estimation

Gao, Tianxiao; Wei, Wu; Cai, Zhongbin; Fan, Zhun; Xie, Shane; Wang, Xinmei; Yu, Qiuda

Computer Science > Computer Vision and Pattern Recognition

arXiv:2107.13800 (cs)

[Submitted on 29 Jul 2021 (v1), last revised 1 Sep 2021 (this version, v2)]

Title:CI-Net: Contextual Information for Joint Semantic Segmentation and Depth Estimation

Authors:Tianxiao Gao, Wu Wei, Zhongbin Cai, Zhun Fan, Shane Xie, Xinmei Wang, Qiuda Yu

View PDF

Abstract:Monocular depth estimation and semantic segmentation are two fundamental goals of scene understanding. Due to the advantages of task interaction, many works study the joint task learning algorithm. However, most existing methods fail to fully leverage the semantic labels, ignoring the provided context structures and only using them to supervise the prediction of segmentation split, which limit the performance of both tasks. In this paper, we propose a network injected with contextual information (CI-Net) to solve the problem. Specifically, we introduce self-attention block in the encoder to generate attention map. With supervision from the ideal attention map created by semantic label, the network is embedded with contextual information so that it could understand scene better and utilize correlated features to make accurate prediction. Besides, a feature sharing module is constructed to make the task-specific features deeply fused and a consistency loss is devised to make the features mutually guided. We evaluate the proposed CI-Net on the NYU-Depth-v2 and SUN-RGBD datasets. The experimental results validate that our proposed CI-Net could effectively improve the accuracy of semantic segmentation and depth estimation.

Comments:	27 pages, 10 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2107.13800 [cs.CV]
	(or arXiv:2107.13800v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2107.13800

Submission history

From: Tianxiao Gao [view email]
[v1] Thu, 29 Jul 2021 07:58:25 UTC (863 KB)
[v2] Wed, 1 Sep 2021 09:16:06 UTC (1,406 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:CI-Net: Contextual Information for Joint Semantic Segmentation and Depth Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:CI-Net: Contextual Information for Joint Semantic Segmentation and Depth Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators