LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes

Xu, Xiang; Kong, Lingdong; Shuai, Hui; Pan, Liang; Liu, Ziwei; Liu, Qingshan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.04004 (cs)

[Submitted on 7 Jan 2025]

Title:LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes

Authors:Xiang Xu, Lingdong Kong, Hui Shuai, Liang Pan, Ziwei Liu, Qingshan Liu

View PDF HTML (experimental)

Abstract:LiDAR data pretraining offers a promising approach to leveraging large-scale, readily available datasets for enhanced data utilization. However, existing methods predominantly focus on sparse voxel representation, overlooking the complementary attributes provided by other LiDAR representations. In this work, we propose LiMoE, a framework that integrates the Mixture of Experts (MoE) paradigm into LiDAR data representation learning to synergistically combine multiple representations, such as range images, sparse voxels, and raw points. Our approach consists of three stages: i) Image-to-LiDAR Pretraining, which transfers prior knowledge from images to point clouds across different representations; ii) Contrastive Mixture Learning (CML), which uses MoE to adaptively activate relevant attributes from each representation and distills these mixed features into a unified 3D network; iii) Semantic Mixture Supervision (SMS), which combines semantic logits from multiple representations to boost downstream segmentation performance. Extensive experiments across 11 large-scale LiDAR datasets demonstrate our effectiveness and superiority. The code and model checkpoints have been made publicly accessible.

Comments:	Preprint; 26 pages, 17 figures, 7 tables; Project Page at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2501.04004 [cs.CV]
	(or arXiv:2501.04004v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.04004

Submission history

From: Lingdong Kong [view email]
[v1] Tue, 7 Jan 2025 18:59:58 UTC (9,214 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LiMoE: Mixture of LiDAR Representation Learners from Automotive Scenes

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators