Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models

Chu, Kun; Zhao, Xufeng; Weber, Cornelius; Li, Mengdi; Wermter, Stefan

Computer Science > Robotics

arXiv:2311.02379 (cs)

[Submitted on 4 Nov 2023]

Title:Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models

Authors:Kun Chu, Xufeng Zhao, Cornelius Weber, Mengdi Li, Stefan Wermter

View PDF

Abstract:Reinforcement Learning (RL) plays an important role in the robotic manipulation domain since it allows self-learning from trial-and-error interactions with the environment. Still, sample efficiency and reward specification seriously limit its potential. One possible solution involves learning from expert guidance. However, obtaining a human expert is impractical due to the high cost of supervising an RL agent, and developing an automatic supervisor is a challenging endeavor. Large Language Models (LLMs) demonstrate remarkable abilities to provide human-like feedback on user inputs in natural language. Nevertheless, they are not designed to directly control low-level robotic motions, as their pretraining is based on vast internet data rather than specific robotics data. In this paper, we introduce the Lafite-RL (Language agent feedback interactive Reinforcement Learning) framework, which enables RL agents to learn robotic tasks efficiently by taking advantage of LLMs' timely feedback. Our experiments conducted on RLBench tasks illustrate that, with simple prompt design in natural language, the Lafite-RL agent exhibits improved learning capabilities when guided by an LLM. It outperforms the baseline in terms of both learning efficiency and success rate, underscoring the efficacy of the rewards provided by an LLM.

Comments:	CoRL 2023 Workshop (oral)
Subjects:	Robotics (cs.RO); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2311.02379 [cs.RO]
	(or arXiv:2311.02379v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2311.02379

Submission history

From: Kun Chu [view email]
[v1] Sat, 4 Nov 2023 11:21:38 UTC (7,785 KB)

Computer Science > Robotics

Title:Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Accelerating Reinforcement Learning of Robotic Manipulations via Feedback from Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators