Learning Language-Conditioned Deformable Object Manipulation with Graph Dynamics

Deng, Yuhong; Mo, Kai; Xia, Chongkun; Wang, Xueqian

Computer Science > Robotics

arXiv:2303.01310 (cs)

[Submitted on 2 Mar 2023 (v1), last revised 29 Jan 2024 (this version, v3)]

Title:Learning Language-Conditioned Deformable Object Manipulation with Graph Dynamics

Authors:Yuhong Deng, Kai Mo, Chongkun Xia, Xueqian Wang

View PDF

Abstract:Multi-task learning of deformable object manipulation is a challenging problem in robot manipulation. Most previous works address this problem in a goal-conditioned way and adapt goal images to specify different tasks, which limits the multi-task learning performance and can not generalize to new tasks. Thus, we adapt language instruction to specify deformable object manipulation tasks and propose a learning framework. We first design a unified Transformer-based architecture to understand multi-modal data and output picking and placing action. Besides, we have introduced the visible connectivity graph to tackle nonlinear dynamics and complex configuration of the deformable object. Both simulated and real experiments have demonstrated that the proposed method is effective and can generalize to unseen instructions and tasks. Compared with the state-of-the-art method, our method achieves higher success rates (87.2% on average) and has a 75.6% shorter inference time. We also demonstrate that our method performs well in real-world experiments.

Comments:	has been accepted by ICRA 2024
Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2303.01310 [cs.RO]
	(or arXiv:2303.01310v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2303.01310

Submission history

From: Yuhong Deng [view email]
[v1] Thu, 2 Mar 2023 14:34:22 UTC (787 KB)
[v2] Mon, 16 Oct 2023 11:57:32 UTC (3,129 KB)
[v3] Mon, 29 Jan 2024 12:07:47 UTC (3,129 KB)

Computer Science > Robotics

Title:Learning Language-Conditioned Deformable Object Manipulation with Graph Dynamics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Learning Language-Conditioned Deformable Object Manipulation with Graph Dynamics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators