Instance Segmentation with Cross-Modal Consistency

Zhu, Alex Zihao; Casser, Vincent; Mahjourian, Reza; Kretzschmar, Henrik; Pirk, Sören

Computer Science > Computer Vision and Pattern Recognition

arXiv:2210.08113 (cs)

[Submitted on 14 Oct 2022]

Title:Instance Segmentation with Cross-Modal Consistency

Authors:Alex Zihao Zhu, Vincent Casser, Reza Mahjourian, Henrik Kretzschmar, Sören Pirk

View PDF

Abstract:Segmenting object instances is a key task in machine perception, with safety-critical applications in robotics and autonomous driving. We introduce a novel approach to instance segmentation that jointly leverages measurements from multiple sensor modalities, such as cameras and LiDAR. Our method learns to predict embeddings for each pixel or point that give rise to a dense segmentation of the scene. Specifically, our technique applies contrastive learning to points in the scene both across sensor modalities and the temporal domain. We demonstrate that this formulation encourages the models to learn embeddings that are invariant to viewpoint variations and consistent across sensor modalities. We further demonstrate that the embeddings are stable over time as objects move around the scene. This not only provides stable instance masks, but can also provide valuable signals to downstream tasks, such as object tracking. We evaluate our method on the Cityscapes and KITTI-360 datasets. We further conduct a number of ablation studies, demonstrating benefits when applying additional inputs for the contrastive loss.

Comments:	8 pages, 9 figures, 5 tables. Presented at IROS 2022
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2210.08113 [cs.CV]
	(or arXiv:2210.08113v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2210.08113

Submission history

From: Alex Zihao Zhu [view email]
[v1] Fri, 14 Oct 2022 21:17:19 UTC (3,721 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Instance Segmentation with Cross-Modal Consistency

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Instance Segmentation with Cross-Modal Consistency

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators