DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

Ma, Tao; Yang, Xuemeng; Zhou, Hongbin; Li, Xin; Shi, Botian; Liu, Junjie; Yang, Yuchen; Liu, Zhizheng; He, Liang; Qiao, Yu; Li, Yikang; Li, Hongsheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.06023 (cs)

[Submitted on 9 Jun 2023 (v1), last revised 17 Aug 2023 (this version, v2)]

Title:DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

Authors:Tao Ma, Xuemeng Yang, Hongbin Zhou, Xin Li, Botian Shi, Junjie Liu, Yuchen Yang, Zhizheng Liu, Liang He, Yu Qiao, Yikang Li, Hongsheng Li

View PDF

Abstract:Existing offboard 3D detectors always follow a modular pipeline design to take advantage of unlimited sequential point clouds. We have found that the full potential of offboard 3D detectors is not explored mainly due to two reasons: (1) the onboard multi-object tracker cannot generate sufficient complete object trajectories, and (2) the motion state of objects poses an inevitable challenge for the object-centric refining stage in leveraging the long-term temporal context representation. To tackle these problems, we propose a novel paradigm of offboard 3D object detection, named DetZero. Concretely, an offline tracker coupled with a multi-frame detector is proposed to focus on the completeness of generated object tracks. An attention-mechanism refining module is proposed to strengthen contextual information interaction across long-term sequential point clouds for object refining with decomposed regression methods. Extensive experiments on Waymo Open Dataset show our DetZero outperforms all state-of-the-art onboard and offboard 3D detection methods. Notably, DetZero ranks 1st place on Waymo 3D object detection leaderboard with 85.15 mAPH (L2) detection performance. Further experiments validate the application of taking the place of human labels with such high-quality results. Our empirical study leads to rethinking conventions and interesting findings that can guide future research on offboard 3D object detection.

Comments:	17 pages, 8 figures, accepted by ICCV 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2306.06023 [cs.CV]
	(or arXiv:2306.06023v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.06023

Submission history

From: Tao Ma [view email]
[v1] Fri, 9 Jun 2023 16:42:00 UTC (13,688 KB)
[v2] Thu, 17 Aug 2023 08:37:46 UTC (13,689 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point Clouds

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators