MAGIC: Learning Macro-Actions for Online POMDP Planning

Lee, Yiyuan; Cai, Panpan; Hsu, David

Computer Science > Robotics

arXiv:2011.03813 (cs)

[Submitted on 7 Nov 2020 (v1), last revised 1 Jul 2021 (this version, v4)]

Title:MAGIC: Learning Macro-Actions for Online POMDP Planning

Authors:Yiyuan Lee, Panpan Cai, David Hsu

View PDF

Abstract:The partially observable Markov decision process (POMDP) is a principled general framework for robot decision making under uncertainty, but POMDP planning suffers from high computational complexity, when long-term planning is required. While temporally-extended macro-actions help to cut down the effective planning horizon and significantly improve computational efficiency, how do we acquire good macro-actions? This paper proposes Macro-Action Generator-Critic (MAGIC), which performs offline learning of macro-actions optimized for online POMDP planning. Specifically, MAGIC learns a macro-action generator end-to-end, using an online planner's performance as the feedback. During online planning, the generator generates on the fly situation-aware macro-actions conditioned on the robot's belief and the environment context. We evaluated MAGIC on several long-horizon planning tasks both in simulation and on a real robot. The experimental results show that the learned macro-actions offer significant benefits in online planning performance, compared with primitive actions and handcrafted macro-actions.

Comments:	9 pages (+ 2 page references, + 2 page appendix)
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2011.03813 [cs.RO]
	(or arXiv:2011.03813v4 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2011.03813

Submission history

From: Yiyuan Lee [view email]
[v1] Sat, 7 Nov 2020 17:18:45 UTC (1,293 KB)
[v2] Mon, 26 Apr 2021 12:45:54 UTC (3,694 KB)
[v3] Wed, 30 Jun 2021 16:20:56 UTC (5,604 KB)
[v4] Thu, 1 Jul 2021 06:04:09 UTC (5,603 KB)

Computer Science > Robotics

Title:MAGIC: Learning Macro-Actions for Online POMDP Planning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:MAGIC: Learning Macro-Actions for Online POMDP Planning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators