Learning a Skill-sequence-dependent Policy for Long-horizon Manipulation Tasks

Li, Zhihao; Sun, Zhenglong; SU, Jionglong; Zhang, Jiaming

Computer Science > Robotics

arXiv:2105.05484 (cs)

[Submitted on 12 May 2021]

Title:Learning a Skill-sequence-dependent Policy for Long-horizon Manipulation Tasks

Authors:Zhihao Li, Zhenglong Sun, Jionglong SU, Jiaming Zhang

View PDF

Abstract:In recent years, the robotics community has made substantial progress in robotic manipulation using deep reinforcement learning (RL). Effectively learning of long-horizon tasks remains a challenging topic. Typical RL-based methods approximate long-horizon tasks as Markov decision processes and only consider current observation (images or other sensor information) as input state. However, such approximation ignores the fact that skill-sequence also plays a crucial role in long-horizon tasks. In this paper, we take both the observation and skill sequences into account and propose a skill-sequence-dependent hierarchical policy for solving a typical long-horizon task. The proposed policy consists of a high-level skill policy (utilizing skill sequences) and a low-level parameter policy (responding to observation) with corresponding training methods, which makes the learning much more sample-efficient. Experiments in simulation demonstrate that our approach successfully solves a long-horizon task and is significantly faster than Proximal Policy Optimization (PPO) and the task schema methods.

Subjects:	Robotics (cs.RO)
Cite as:	arXiv:2105.05484 [cs.RO]
	(or arXiv:2105.05484v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2105.05484

Submission history

From: Zhihao Li [view email]
[v1] Wed, 12 May 2021 07:46:56 UTC (1,695 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.RO

< prev | next >

new | recent | 2021-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zhihao Li
Jiaming Zhang

export BibTeX citation

Computer Science > Robotics

Title:Learning a Skill-sequence-dependent Policy for Long-horizon Manipulation Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Learning a Skill-sequence-dependent Policy for Long-horizon Manipulation Tasks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators