GEDepth: Ground Embedding for Monocular Depth Estimation

Yang, Xiaodong; Ma, Zhuang; Ji, Zhiyu; Ren, Zhe

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.09975 (cs)

[Submitted on 18 Sep 2023]

Title:GEDepth: Ground Embedding for Monocular Depth Estimation

Authors:Xiaodong Yang, Zhuang Ma, Zhiyu Ji, Zhe Ren

View PDF

Abstract:Monocular depth estimation is an ill-posed problem as the same 2D image can be projected from infinite 3D scenes. Although the leading algorithms in this field have reported significant improvement, they are essentially geared to the particular compound of pictorial observations and camera parameters (i.e., intrinsics and extrinsics), strongly limiting their generalizability in real-world scenarios. To cope with this challenge, this paper proposes a novel ground embedding module to decouple camera parameters from pictorial cues, thus promoting the generalization capability. Given camera parameters, the proposed module generates the ground depth, which is stacked with the input image and referenced in the final depth prediction. A ground attention is designed in the module to optimally combine ground depth with residual depth. Our ground embedding is highly flexible and lightweight, leading to a plug-in module that is amenable to be integrated into various depth estimation networks. Experiments reveal that our approach achieves the state-of-the-art results on popular benchmarks, and more importantly, renders significant generalization improvement on a wide range of cross-domain tests.

Comments:	ICCV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2309.09975 [cs.CV]
	(or arXiv:2309.09975v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.09975

Submission history

From: Xiaodong Yang [view email]
[v1] Mon, 18 Sep 2023 17:56:06 UTC (10,856 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GEDepth: Ground Embedding for Monocular Depth Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GEDepth: Ground Embedding for Monocular Depth Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators