SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

Yi, Hongwei; Shi, Shaoshuai; Ding, Mingyu; Sun, Jiankai; Xu, Kui; Zhou, Hui; Wang, Zhe; Li, Sheng; Wang, Guoping

Computer Science > Computer Vision and Pattern Recognition

arXiv:2002.05316 (cs)

[Submitted on 13 Feb 2020]

Title:SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

Authors:Hongwei Yi, Shaoshuai Shi, Mingyu Ding, Jiankai Sun, Kui Xu, Hui Zhou, Zhe Wang, Sheng Li, Guoping Wang

View PDF

Abstract:3D vehicle detection based on point cloud is a challenging task in real-world applications such as autonomous driving. Despite significant progress has been made, we observe two aspects to be further improved. First, the semantic context information in LiDAR is seldom explored in previous works, which may help identify ambiguous vehicles. Second, the distribution of point cloud on vehicles varies continuously with increasing depths, which may not be well modeled by a single model. In this work, we propose a unified model SegVoxelNet to address the above two problems. A semantic context encoder is proposed to leverage the free-of-charge semantic segmentation masks in the bird's eye view. Suspicious regions could be highlighted while noisy regions are suppressed by this module. To better deal with vehicles at different depths, a novel depth-aware head is designed to explicitly model the distribution differences and each part of the depth-aware head is made to focus on its own target detection range. Extensive experiments on the KITTI dataset show that the proposed method outperforms the state-of-the-art alternatives in both accuracy and efficiency with point cloud as input only.

Comments:	Accepted by ICRA2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2002.05316 [cs.CV]
	(or arXiv:2002.05316v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2002.05316

Submission history

From: Hongwei Yi [view email]
[v1] Thu, 13 Feb 2020 02:42:31 UTC (5,577 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2020-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Shaoshuai Shi
Mingyu Ding
Jiankai Sun
Kui Xu
Hui Zhou

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SegVoxelNet: Exploring Semantic Context and Depth-aware Features for 3D Vehicle Detection from Point Cloud

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators