Unsupervised Learning of Depth and Deep Representation for Visual Odometry from Monocular Videos in a Metric Space

Yin, Xiaochuan; Liu, Chengju

Computer Science > Computer Vision and Pattern Recognition

arXiv:1908.01367 (cs)

[Submitted on 4 Aug 2019]

Title:Unsupervised Learning of Depth and Deep Representation for Visual Odometry from Monocular Videos in a Metric Space

Authors:Xiaochuan Yin, Chengju Liu

View PDF

Abstract:For ego-motion estimation, the feature representation of the scenes is crucial. Previous methods indicate that both the low-level and semantic feature-based methods can achieve promising results. Therefore, the incorporation of hierarchical feature representation may benefit from both methods. From this perspective, we propose a novel direct feature odometry framework, named DFO, for depth estimation and hierarchical feature representation learning from monocular videos. By exploiting the metric distance, our framework is able to learn the hierarchical feature representation without supervision. The pose is obtained with a coarse-to-fine approach from high-level to low-level features in enlarged feature maps. The pixel-level attention mask can be self-learned to provide the prior information. In contrast to the previous methods, our proposed method calculates the camera motion with a direct method rather than regressing the ego-motion from the pose network. With this approach, the consistency of the scale factor of translation can be constrained. Additionally, the proposed method is thus compatible with the traditional SLAM pipeline. Experiments on the KITTI dataset demonstrate the effectiveness of our method.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
Cite as:	arXiv:1908.01367 [cs.CV]
	(or arXiv:1908.01367v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1908.01367

Submission history

From: Xiaochuan Yin [view email]
[v1] Sun, 4 Aug 2019 15:48:31 UTC (5,642 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Learning of Depth and Deep Representation for Visual Odometry from Monocular Videos in a Metric Space

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Unsupervised Learning of Depth and Deep Representation for Visual Odometry from Monocular Videos in a Metric Space

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators