Any-point Trajectory Modeling for Policy Learning

Wen, Chuan; Lin, Xingyu; So, John; Chen, Kai; Dou, Qi; Gao, Yang; Abbeel, Pieter

Computer Science > Robotics

arXiv:2401.00025v3 (cs)

[Submitted on 28 Dec 2023 (v1), last revised 12 Jul 2024 (this version, v3)]

Title:Any-point Trajectory Modeling for Policy Learning

Authors:Chuan Wen, Xingyu Lin, John So, Kai Chen, Qi Dou, Yang Gao, Pieter Abbeel

View PDF HTML (experimental)

Abstract:Learning from demonstration is a powerful method for teaching robots new skills, and having more demonstration data often improves policy learning. However, the high cost of collecting demonstration data is a significant bottleneck. Videos, as a rich data source, contain knowledge of behaviors, physics, and semantics, but extracting control-specific information from them is challenging due to the lack of action labels. In this work, we introduce a novel framework, Any-point Trajectory Modeling (ATM), that utilizes video demonstrations by pre-training a trajectory model to predict future trajectories of arbitrary points within a video frame. Once trained, these trajectories provide detailed control guidance, enabling the learning of robust visuomotor policies with minimal action-labeled data. Across over 130 language-conditioned tasks we evaluated in both simulation and the real world, ATM outperforms strong video pre-training baselines by 80% on average. Furthermore, we show effective transfer learning of manipulation skills from human videos and videos from a different robot morphology. Visualizations and code are available at: \url{this https URL}.

Comments:	18 pages, 15 figures
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2401.00025 [cs.RO]
	(or arXiv:2401.00025v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2401.00025

Submission history

From: Chuan Wen [view email]
[v1] Thu, 28 Dec 2023 23:34:43 UTC (1,628 KB)
[v2] Fri, 16 Feb 2024 06:55:06 UTC (14,156 KB)
[v3] Fri, 12 Jul 2024 12:51:00 UTC (14,863 KB)

Computer Science > Robotics

Title:Any-point Trajectory Modeling for Policy Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Any-point Trajectory Modeling for Policy Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators