Energy-Based Hindsight Experience Prioritization

Zhao, Rui; Tresp, Volker

Computer Science > Machine Learning

arXiv:1810.01363 (cs)

[Submitted on 2 Oct 2018 (v1), last revised 24 May 2020 (this version, v5)]

Title:Energy-Based Hindsight Experience Prioritization

Authors:Rui Zhao, Volker Tresp

View PDF

Abstract:In Hindsight Experience Replay (HER), a reinforcement learning agent is trained by treating whatever it has achieved as virtual goals. However, in previous work, the experience was replayed at random, without considering which episode might be the most valuable for learning. In this paper, we develop an energy-based framework for prioritizing hindsight experience in robotic manipulation tasks. Our approach is inspired by the work-energy principle in physics. We define a trajectory energy function as the sum of the transition energy of the target object over the trajectory. We hypothesize that replaying episodes that have high trajectory energy is more effective for reinforcement learning in robotics. To verify our hypothesis, we designed a framework for hindsight experience prioritization based on the trajectory energy of goal states. The trajectory energy function takes the potential, kinetic, and rotational energy into consideration. We evaluate our Energy-Based Prioritization (EBP) approach on four challenging robotic manipulation tasks in simulation. Our empirical results show that our proposed method surpasses state-of-the-art approaches in terms of both performance and sample-efficiency on all four tasks, without increasing computational time. A video showing experimental results is available at this https URL

Comments:	Published in Conference on Robot Learning (CoRL 2018) as oral presentation (7%), Zurich, Switzerland
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:1810.01363 [cs.LG]
	(or arXiv:1810.01363v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.01363
Journal reference:	PMLR 87:113-122, 2018

Submission history

From: Rui Zhao [view email]
[v1] Tue, 2 Oct 2018 16:42:35 UTC (2,446 KB)
[v2] Wed, 3 Oct 2018 08:04:51 UTC (2,446 KB)
[v3] Mon, 8 Oct 2018 14:44:40 UTC (2,446 KB)
[v4] Wed, 20 Feb 2019 10:15:33 UTC (2,446 KB)
[v5] Sun, 24 May 2020 07:57:13 UTC (2,447 KB)

Computer Science > Machine Learning

Title:Energy-Based Hindsight Experience Prioritization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Energy-Based Hindsight Experience Prioritization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators