Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning

Bellegarda, Guillaume; Chen, Yiyu; Liu, Zhuochen; Nguyen, Quan

doi:10.1109/IROS47612.2022.9982132

Computer Science > Robotics

arXiv:2103.06484 (cs)

[Submitted on 11 Mar 2021 (v1), last revised 15 Mar 2023 (this version, v2)]

Title:Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning

Authors:Guillaume Bellegarda, Yiyu Chen, Zhuochen Liu, Quan Nguyen

View PDF

Abstract:Deep reinforcement learning has emerged as a popular and powerful way to develop locomotion controllers for quadruped robots. Common approaches have largely focused on learning actions directly in joint space, or learning to modify and offset foot positions produced by trajectory generators. Both approaches typically require careful reward shaping and training for millions of time steps, and with trajectory generators introduce human bias into the resulting control policies. In this paper, we present a learning framework that leads to the natural emergence of fast and robust bounding policies for quadruped robots. The agent both selects and controls actions directly in task space to track desired velocity commands subject to environmental noise including model uncertainty and rough terrain. We observe that this framework improves sample efficiency, necessitates little reward shaping, leads to the emergence of natural gaits such as galloping and bounding, and eases the sim-to-real transfer at running speeds. Policies can be learned in only a few million time steps, even for challenging tasks of running over rough terrain with loads of over 100% of the nominal quadruped mass. Training occurs in PyBullet, and we perform a sim-to-sim transfer to Gazebo and sim-to-real transfer to the Unitree A1 hardware. For sim-to-sim, our results show the quadruped is able to run at over 4 m/s without a load, and 3.5 m/s with a 10 kg load, which is over 83% of the nominal quadruped mass. For sim-to-real, the Unitree A1 is able to bound at 2 m/s with a 5 kg load, representing 42% of the nominal quadruped mass.

Subjects:	Robotics (cs.RO); Machine Learning (cs.LG); Systems and Control (eess.SY)
Cite as:	arXiv:2103.06484 [cs.RO]
	(or arXiv:2103.06484v2 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2103.06484
Journal reference:	2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 2022, pp. 10364-10370
Related DOI:	https://doi.org/10.1109/IROS47612.2022.9982132

Submission history

From: Guillaume Bellegarda [view email]
[v1] Thu, 11 Mar 2021 06:13:09 UTC (4,531 KB)
[v2] Wed, 15 Mar 2023 13:22:11 UTC (5,431 KB)

Computer Science > Robotics

Title:Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators