TAX-Pose: Task-Specific Cross-Pose Estimation for Robot Manipulation

Pan, Chuer; Okorn, Brian; Zhang, Harry; Eisner, Ben; Held, David

Computer Science > Robotics

arXiv:2211.09325 (cs)

[Submitted on 17 Nov 2022 (v1), last revised 2 May 2024 (this version, v3)]

Title:TAX-Pose: Task-Specific Cross-Pose Estimation for Robot Manipulation

Authors:Chuer Pan, Brian Okorn, Harry Zhang, Ben Eisner, David Held

View PDF HTML (experimental)

Abstract:How do we imbue robots with the ability to efficiently manipulate unseen objects and transfer relevant skills based on demonstrations? End-to-end learning methods often fail to generalize to novel objects or unseen configurations. Instead, we focus on the task-specific pose relationship between relevant parts of interacting objects. We conjecture that this relationship is a generalizable notion of a manipulation task that can transfer to new objects in the same category; examples include the relationship between the pose of a pan relative to an oven or the pose of a mug relative to a mug rack. We call this task-specific pose relationship "cross-pose" and provide a mathematical definition of this concept. We propose a vision-based system that learns to estimate the cross-pose between two objects for a given manipulation task using learned cross-object correspondences. The estimated cross-pose is then used to guide a downstream motion planner to manipulate the objects into the desired pose relationship (placing a pan into the oven or the mug onto the mug rack). We demonstrate our method's capability to generalize to unseen objects, in some cases after training on only 10 demonstrations in the real world. Results show that our system achieves state-of-the-art performance in both simulated and real-world experiments across a number of tasks. Supplementary information and videos can be found at this https URL.

Comments:	Conference on Robot Learning (CoRL), 2022. Supplementary material is available at this https URL
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2211.09325 [cs.RO]
	(or arXiv:2211.09325v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2211.09325

Submission history

From: Harry Zhang Mr. [view email]
[v1] Thu, 17 Nov 2022 04:06:16 UTC (25,845 KB)
[v2] Fri, 21 Apr 2023 03:10:27 UTC (25,975 KB)
[v3] Thu, 2 May 2024 16:04:19 UTC (44,988 KB)

Computer Science > Robotics

Title:TAX-Pose: Task-Specific Cross-Pose Estimation for Robot Manipulation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:TAX-Pose: Task-Specific Cross-Pose Estimation for Robot Manipulation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators