Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features

Lu, Hsin-Cheng; Lin, Chung-Yi; Hsu, Winston H.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.02322 (cs)

[Submitted on 4 Feb 2025]

Title:Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features

Authors:Hsin-Cheng Lu, Chung-Yi Lin, Winston H. Hsu

View PDF HTML (experimental)

Abstract:In autonomous driving, 3D object detection is essential for accurately identifying and tracking objects. Despite the continuous development of various technologies for this task, a significant drawback is observed in most of them-they experience substantial performance degradation when detecting objects in unseen domains. In this paper, we propose a method to improve the generalization ability for 3D object detection on a single domain. We primarily focus on generalizing from a single source domain to target domains with distinct sensor configurations and scene distributions. To learn sparsity-invariant features from a single source domain, we selectively subsample the source data to a specific beam, using confidence scores determined by the current detector to identify the density that holds utmost importance for the detector. Subsequently, we employ the teacher-student framework to align the Bird's Eye View (BEV) features for different point clouds densities. We also utilize feature content alignment (FCA) and graph-based embedding relationship alignment (GERA) to instruct the detector to be domain-agnostic. Extensive experiments demonstrate that our method exhibits superior generalization capabilities compared to other baselines. Furthermore, our approach even outperforms certain domain adaptation methods that can access to the target domain data.

Comments:	Accepted to ICRA 2025. Code is available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2502.02322 [cs.CV]
	(or arXiv:2502.02322v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.02322

Submission history

From: Hsin-Cheng Lu [view email]
[v1] Tue, 4 Feb 2025 13:47:02 UTC (1,584 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Improving Generalization Ability for 3D Object Detection by Learning Sparsity-invariant Features

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators