Towards Realistic Scene Generation with LiDAR Diffusion Models

Ran, Haoxi; Guizilini, Vitor; Wang, Yue

Computer Science > Computer Vision and Pattern Recognition

arXiv:2404.00815 (cs)

[Submitted on 31 Mar 2024 (v1), last revised 18 Apr 2024 (this version, v2)]

Title:Towards Realistic Scene Generation with LiDAR Diffusion Models

Authors:Haoxi Ran, Vitor Guizilini, Yue Wang

View PDF HTML (experimental)

Abstract:Diffusion models (DMs) excel in photo-realistic image synthesis, but their adaptation to LiDAR scene generation poses a substantial hurdle. This is primarily because DMs operating in the point space struggle to preserve the curve-like patterns and 3D geometry of LiDAR scenes, which consumes much of their representation power. In this paper, we propose LiDAR Diffusion Models (LiDMs) to generate LiDAR-realistic scenes from a latent space tailored to capture the realism of LiDAR scenes by incorporating geometric priors into the learning pipeline. Our method targets three major desiderata: pattern realism, geometry realism, and object realism. Specifically, we introduce curve-wise compression to simulate real-world LiDAR patterns, point-wise coordinate supervision to learn scene geometry, and patch-wise encoding for a full 3D object context. With these three core designs, our method achieves competitive performance on unconditional LiDAR generation in 64-beam scenario and state of the art on conditional LiDAR generation, while maintaining high efficiency compared to point-based DMs (up to 107$\times$ faster). Furthermore, by compressing LiDAR scenes into a latent space, we enable the controllability of DMs with various conditions such as semantic maps, camera views, and text prompts.

Comments:	CVPR 2024. Project link: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2404.00815 [cs.CV]
	(or arXiv:2404.00815v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2404.00815

Submission history

From: Haoxi Ran [view email]
[v1] Sun, 31 Mar 2024 22:18:56 UTC (11,553 KB)
[v2] Thu, 18 Apr 2024 19:22:37 UTC (17,614 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Realistic Scene Generation with LiDAR Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Towards Realistic Scene Generation with LiDAR Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators