Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement

Li, Wenxuan; Zou, Qin; Chen, Chi; Du, Bo; Chen, Long; Zhou, Jian; Yu, Hongkai

Computer Science > Computer Vision and Pattern Recognition

arXiv:2408.07999 (cs)

[Submitted on 15 Aug 2024 (v1), last revised 15 Nov 2024 (this version, v2)]

Title:Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement

Authors:Wenxuan Li, Qin Zou, Chi Chen, Bo Du, Long Chen, Jian Zhou, Hongkai Yu

View PDF HTML (experimental)

Abstract:3D object detection in driving scenarios faces the challenge of complex road environments, which can lead to the loss or incompleteness of key features, thereby affecting perception performance. To address this issue, we propose an advanced detection framework called Co-Fix3D. Co-Fix3D integrates Local and Global Enhancement (LGE) modules to refine Bird's Eye View (BEV) features. The LGE module uses Discrete Wavelet Transform (DWT) for pixel-level local optimization and incorporates an attention mechanism for global optimization. To handle varying detection difficulties, we adopt multi-head LGE modules, enabling each module to focus on targets with different levels of detection complexity, thus further enhancing overall perception capability. Experimental results show that on the nuScenes dataset's LiDAR benchmark, Co-Fix3D achieves 69.4\% mAP and 73.5\% NDS, while on the multimodal benchmark, it achieves 72.3\% mAP and 74.7\% NDS. The source code is publicly available at \href{this https URL}{this https URL}.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2408.07999 [cs.CV]
	(or arXiv:2408.07999v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2408.07999

Submission history

From: Wenxuan Li [view email]
[v1] Thu, 15 Aug 2024 07:56:02 UTC (2,570 KB)
[v2] Fri, 15 Nov 2024 04:09:34 UTC (1,359 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Co-Fix3D: Enhancing 3D Object Detection with Collaborative Refinement

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators