Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following

Yang, Yuxiao; Zhang, Shenao; Liu, Zhihan; Yao, Huaxiu; Wang, Zhaoran

Computer Science > Artificial Intelligence

arXiv:2412.19562 (cs)

[Submitted on 27 Dec 2024]

Title:Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following

Authors:Yuxiao Yang, Shenao Zhang, Zhihan Liu, Huaxiu Yao, Zhaoran Wang

View PDF HTML (experimental)

Abstract:This work focuses on building a task planner for Embodied Instruction Following (EIF) using Large Language Models (LLMs). Previous works typically train a planner to imitate expert trajectories, treating this as a supervised task. While these methods achieve competitive performance, they often lack sufficient robustness. When a suboptimal action is taken, the planner may encounter an out-of-distribution state, which can lead to task failure. In contrast, we frame the task as a Partially Observable Markov Decision Process (POMDP) and aim to develop a robust planner under a few-shot assumption. Thus, we propose a closed-loop planner with an adaptation module and a novel hindsight method, aiming to use as much information as possible to assist the planner. Our experiments on the ALFRED dataset indicate that our planner achieves competitive performance under a few-shot assumption. For the first time, our few-shot agent's performance approaches and even surpasses that of the full-shot supervised agent.

Subjects:	Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2412.19562 [cs.AI]
	(or arXiv:2412.19562v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2412.19562

Submission history

From: Yuxiao Yang [view email]
[v1] Fri, 27 Dec 2024 10:05:45 UTC (4,322 KB)

Computer Science > Artificial Intelligence

Title:Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators