Zero-Shot 4D Lidar Panoptic Segmentation

Zhang, Yushan; Ošep, Aljoša; Leal-Taixé, Laura; Meinhardt, Tim

Computer Science > Computer Vision and Pattern Recognition

arXiv:2504.00848 (cs)

[Submitted on 1 Apr 2025]

Title:Zero-Shot 4D Lidar Panoptic Segmentation

Authors:Yushan Zhang, Aljoša Ošep, Laura Leal-Taixé, Tim Meinhardt

View PDF

Abstract:Zero-shot 4D segmentation and recognition of arbitrary objects in Lidar is crucial for embodied navigation, with applications ranging from streaming perception to semantic mapping and localization. However, the primary challenge in advancing research and developing generalized, versatile methods for spatio-temporal scene understanding in Lidar lies in the scarcity of datasets that provide the necessary diversity and scale of this http URL overcome these challenges, we propose SAL-4D (Segment Anything in Lidar--4D), a method that utilizes multi-modal robotic sensor setups as a bridge to distill recent developments in Video Object Segmentation (VOS) in conjunction with off-the-shelf Vision-Language foundation models to Lidar. We utilize VOS models to pseudo-label tracklets in short video sequences, annotate these tracklets with sequence-level CLIP tokens, and lift them to the 4D Lidar space using calibrated multi-modal sensory setups to distill them to our SAL-4D model. Due to temporal consistent predictions, we outperform prior art in 3D Zero-Shot Lidar Panoptic Segmentation (LPS) over $5$ PQ, and unlock Zero-Shot 4D-LPS.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.00848 [cs.CV]
	(or arXiv:2504.00848v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2504.00848

Submission history

From: Yushan Zhang [view email]
[v1] Tue, 1 Apr 2025 14:36:12 UTC (24,384 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-Shot 4D Lidar Panoptic Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-Shot 4D Lidar Panoptic Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators