Multi-body SE(3) Equivariance for Unsupervised Rigid Segmentation and Motion Estimation

Zhong, Jia-Xing; Cheng, Ta-Ying; He, Yuhang; Lu, Kai; Zhou, Kaichen; Markham, Andrew; Trigoni, Niki

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.05584 (cs)

[Submitted on 8 Jun 2023 (v1), last revised 31 Oct 2023 (this version, v2)]

Title:Multi-body SE(3) Equivariance for Unsupervised Rigid Segmentation and Motion Estimation

Authors:Jia-Xing Zhong, Ta-Ying Cheng, Yuhang He, Kai Lu, Kaichen Zhou, Andrew Markham, Niki Trigoni

View PDF

Abstract:A truly generalizable approach to rigid segmentation and motion estimation is fundamental to 3D understanding of articulated objects and moving scenes. In view of the closely intertwined relationship between segmentation and motion estimates, we present an SE(3) equivariant architecture and a training strategy to tackle this task in an unsupervised manner. Our architecture is composed of two interconnected, lightweight heads. These heads predict segmentation masks using point-level invariant features and estimate motion from SE(3) equivariant features, all without the need for category information. Our training strategy is unified and can be implemented online, which jointly optimizes the predicted segmentation and motion by leveraging the interrelationships among scene flow, segmentation mask, and rigid transformations. We conduct experiments on four datasets to demonstrate the superiority of our method. The results show that our method excels in both model performance and computational efficiency, with only 0.25M parameters and 0.92G FLOPs. To the best of our knowledge, this is the first work designed for category-agnostic part-level SE(3) equivariance in dynamic point clouds.

Comments:	To appear at NeurIPS 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG); Multimedia (cs.MM)
Cite as:	arXiv:2306.05584 [cs.CV]
	(or arXiv:2306.05584v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.05584

Submission history

From: Jia-Xing Zhong [view email]
[v1] Thu, 8 Jun 2023 22:55:32 UTC (1,517 KB)
[v2] Tue, 31 Oct 2023 13:46:52 UTC (4,916 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-body SE(3) Equivariance for Unsupervised Rigid Segmentation and Motion Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Multi-body SE(3) Equivariance for Unsupervised Rigid Segmentation and Motion Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators