S$^2$-Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation

Yang, Quantao; Welle, Michael C.; Kragic, Danica; Andersson, Olov

Computer Science > Robotics

arXiv:2502.09389 (cs)

[Submitted on 13 Feb 2025 (v1), last revised 17 Feb 2025 (this version, v2)]

Title:S$^2$-Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation

Authors:Quantao Yang, Michael C. Welle, Danica Kragic, Olov Andersson

View PDF HTML (experimental)

Abstract:Recent advances in skill learning has propelled robot manipulation to new heights by enabling it to learn complex manipulation tasks from a practical number of demonstrations. However, these skills are often limited to the particular action, object, and environment \textit{instances} that are shown in the training data, and have trouble transferring to other instances of the same category. In this work we present an open-vocabulary Spatial-Semantic Diffusion policy (S$^2$-Diffusion) which enables generalization from instance-level training data to category-level, enabling skills to be transferable between instances of the same category. We show that functional aspects of skills can be captured via a promptable semantic module combined with a spatial representation. We further propose leveraging depth estimation networks to allow the use of only a single RGB camera. Our approach is evaluated and compared on a diverse number of robot manipulation tasks, both in simulation and in the real world. Our results show that S$^2$-Diffusion is invariant to changes in category-irrelevant factors as well as enables satisfying performance on other instances within the same category, even if it was not trained on that specific instance. Full videos of all real-world experiments are available in the supplementary material.

Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2502.09389 [cs.RO]
	(or arXiv:2502.09389v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2502.09389

Submission history

From: Quantao Yang [view email]
[v1] Thu, 13 Feb 2025 15:06:42 UTC (13,580 KB)
[v2] Mon, 17 Feb 2025 08:38:28 UTC (13,580 KB)

Computer Science > Robotics

Title:S$^2$-Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:S$^2$-Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators