Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving

Zhu, Zijian; Zhang, Yichi; Chen, Hai; Dong, Yinpeng; Zhao, Shu; Ding, Wenbo; Zhong, Jiachen; Zheng, Shibao

Computer Science > Computer Vision and Pattern Recognition

arXiv:2303.17297v1 (cs)

[Submitted on 30 Mar 2023 (this version), latest version 16 Sep 2023 (v2)]

Title:Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving

Authors:Zijian Zhu, Yichi Zhang, Hai Chen, Yinpeng Dong, Shu Zhao, Wenbo Ding, Jiachen Zhong, Shibao Zheng

View PDF

Abstract:3D object detection is an essential perception task in autonomous driving to understand the environments. The Bird's-Eye-View (BEV) representations have significantly improved the performance of 3D detectors with camera inputs on popular benchmarks. However, there still lacks a systematic understanding of the robustness of these vision-dependent BEV models, which is closely related to the safety of autonomous driving systems. In this paper, we evaluate the natural and adversarial robustness of various representative models under extensive settings, to fully understand their behaviors influenced by explicit BEV features compared with those without BEV. In addition to the classic settings, we propose a 3D consistent patch attack by applying adversarial patches in the 3D space to guarantee the spatiotemporal consistency, which is more realistic for the scenario of autonomous driving. With substantial experiments, we draw several findings: 1) BEV models tend to be more stable than previous methods under different natural conditions and common corruptions due to the expressive spatial representations; 2) BEV models are more vulnerable to adversarial noises, mainly caused by the redundant BEV features; 3) Camera-LiDAR fusion models have superior performance under different settings with multi-modal inputs, but BEV fusion model is still vulnerable to adversarial noises of both point cloud and image. These findings alert the safety issue in the applications of BEV detectors and could facilitate the development of more robust models.

Comments:	8 pages, CVPR2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Cryptography and Security (cs.CR)
Cite as:	arXiv:2303.17297 [cs.CV]
	(or arXiv:2303.17297v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2303.17297

Submission history

From: Zijian Zhu [view email]
[v1] Thu, 30 Mar 2023 11:16:58 UTC (5,006 KB)
[v2] Sat, 16 Sep 2023 12:42:11 UTC (5,007 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Understanding the Robustness of 3D Object Detection with Bird's-Eye-View Representations in Autonomous Driving

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators