Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos

Lu, Shiyang; Deng, Yunfu; Boularias, Abdeslam; Bekris, Kostas

Computer Science > Computer Vision and Pattern Recognition

arXiv:2304.04325 (cs)

[Submitted on 9 Apr 2023]

Title:Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos

Authors:Shiyang Lu, Yunfu Deng, Abdeslam Boularias, Kostas Bekris

View PDF

Abstract:This work proposes a self-supervised learning system for segmenting rigid objects in RGB images. The proposed pipeline is trained on unlabeled RGB-D videos of static objects, which can be captured with a camera carried by a mobile robot. A key feature of the self-supervised training process is a graph-matching algorithm that operates on the over-segmentation output of the point cloud that is reconstructed from each video. The graph matching, along with point cloud registration, is able to find reoccurring object patterns across videos and combine them into 3D object pseudo labels, even under occlusions or different viewing angles. Projected 2D object masks from 3D pseudo labels are used to train a pixel-wise feature extractor through contrastive learning. During online inference, a clustering method uses the learned features to cluster foreground pixels into object segments. Experiments highlight the method's effectiveness on both real and synthetic video datasets, which include cluttered scenes of tabletop objects. The proposed method outperforms existing unsupervised methods for object segmentation by a large margin.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO)
Cite as:	arXiv:2304.04325 [cs.CV]
	(or arXiv:2304.04325v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2304.04325

Submission history

From: Shiyang Lu [view email]
[v1] Sun, 9 Apr 2023 23:13:39 UTC (4,597 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators