Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models

Chen, Haonan; Xu, Jiaming; Sheng, Lily; Ji, Tianchen; Liu, Shuijing; Li, Yunzhu; Driggs-Campbell, Katherine

Computer Science > Robotics

arXiv:2503.23271 (cs)

[Submitted on 30 Mar 2025]

Title:Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models

Authors:Haonan Chen, Jiaming Xu, Lily Sheng, Tianchen Ji, Shuijing Liu, Yunzhu Li, Katherine Driggs-Campbell

View PDF HTML (experimental)

Abstract:When performing tasks like laundry, humans naturally coordinate both hands to manipulate objects and anticipate how their actions will change the state of the clothes. However, achieving such coordination in robotics remains challenging due to the need to model object movement, predict future states, and generate precise bimanual actions. In this work, we address these challenges by infusing the predictive nature of human manipulation strategies into robot imitation learning. Specifically, we disentangle task-related state transitions from agent-specific inverse dynamics modeling to enable effective bimanual coordination. Using a demonstration dataset, we train a diffusion model to predict future states given historical observations, envisioning how the scene evolves. Then, we use an inverse dynamics model to compute robot actions that achieve the predicted states. Our key insight is that modeling object movement can help learning policies for bimanual coordination manipulation tasks. Evaluating our framework across diverse simulation and real-world manipulation setups, including multimodal goal configurations, bimanual manipulation, deformable objects, and multi-object setups, we find that it consistently outperforms state-of-the-art state-to-action mapping policies. Our method demonstrates a remarkable capacity to navigate multimodal goal configurations and action distributions, maintain stability across different control modes, and synthesize a broader range of behaviors than those present in the demonstration dataset.

Comments:	Project Page: this https URL. 12 pages, 12 figures, Accepted at ICRA 2025
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2503.23271 [cs.RO]
	(or arXiv:2503.23271v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2503.23271

Submission history

From: Haonan Chen [view email]
[v1] Sun, 30 Mar 2025 01:25:35 UTC (44,202 KB)

Computer Science > Robotics

Title:Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators