Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection

Mouawad, Issa; Brasch, Nikolas; Manhardt, Fabian; Tombari, Federico; Odone, Francesca

doi:10.1109/LRA.2022.3188882

Computer Science > Computer Vision and Pattern Recognition

arXiv:2203.02193 (cs)

[Submitted on 4 Mar 2022]

Title:Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection

Authors:Issa Mouawad, Nikolas Brasch, Fabian Manhardt, Federico Tombari, Francesca Odone

View PDF

Abstract:Monocular 3D object detection continues to attract attention due to the cost benefits and wider availability of RGB cameras. Despite the recent advances and the ability to acquire data at scale, annotation cost and complexity still limit the size of 3D object detection datasets in the supervised settings. Self-supervised methods, on the other hand, aim at training deep networks relying on pretext tasks or various consistency constraints. Moreover, other 3D perception tasks (such as depth estimation) have shown the benefits of temporal priors as a self-supervision signal. In this work, we argue that the temporal consistency on the level of object poses, provides an important supervision signal given the strong prior on physical motion. Specifically, we propose a self-supervised loss which uses this consistency, in addition to render-and-compare losses, to refine noisy pose predictions and derive high-quality pseudo labels. To assess the effectiveness of the proposed method, we finetune a synthetically trained monocular 3D object detection model using the pseudo-labels that we generated on real data. Evaluation on the standard KITTI3D benchmark demonstrates that our method reaches competitive performance compared to other monocular self-supervised and supervised methods.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2203.02193 [cs.CV]
	(or arXiv:2203.02193v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2203.02193
Related DOI:	https://doi.org/10.1109/LRA.2022.3188882

Submission history

From: Issa Mouawad [view email]
[v1] Fri, 4 Mar 2022 08:55:49 UTC (7,038 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators