GLPanoDepth: Global-to-Local Panoramic Depth Estimation

Bai, Jiayang; Lai, Shuichang; Qin, Haoyu; Guo, Jie; Guo, Yanwen

Computer Science > Computer Vision and Pattern Recognition

arXiv:2202.02796 (cs)

[Submitted on 6 Feb 2022 (v1), last revised 8 Feb 2022 (this version, v2)]

Title:GLPanoDepth: Global-to-Local Panoramic Depth Estimation

Authors:Jiayang Bai, Shuichang Lai, Haoyu Qin, Jie Guo, Yanwen Guo

View PDF

Abstract:In this paper, we propose a learning-based method for predicting dense depth values of a scene from a monocular omnidirectional image. An omnidirectional image has a full field-of-view, providing much more complete descriptions of the scene than perspective images. However, fully-convolutional networks that most current solutions rely on fail to capture rich global contexts from the panorama. To address this issue and also the distortion of equirectangular projection in the panorama, we propose Cubemap Vision Transformers (CViT), a new transformer-based architecture that can model long-range dependencies and extract distortion-free global features from the panorama. We show that cubemap vision transformers have a global receptive field at every stage and can provide globally coherent predictions for spherical signals. To preserve important local features, we further design a convolution-based branch in our pipeline (dubbed GLPanoDepth) and fuse global features from cubemap vision transformers at multiple scales. This global-to-local strategy allows us to fully exploit useful global and local features in the panorama, achieving state-of-the-art performance in panoramic depth estimation.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2202.02796 [cs.CV]
	(or arXiv:2202.02796v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2202.02796

Submission history

From: Jiayang Bai [view email]
[v1] Sun, 6 Feb 2022 15:11:58 UTC (7,948 KB)
[v2] Tue, 8 Feb 2022 11:51:33 UTC (7,948 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:GLPanoDepth: Global-to-Local Panoramic Depth Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:GLPanoDepth: Global-to-Local Panoramic Depth Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators