Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Abramson, Josh; Ahuja, Arun; Carnevale, Federico; Georgiev, Petko; Goldin, Alex; Hung, Alden; Landon, Jessica; Lhotka, Jirka; Lillicrap, Timothy; Muldal, Alistair; Powell, George; Santoro, Adam; Scully, Guy; Srivastava, Sanjana; von Glehn, Tamara; Wayne, Greg; Wong, Nathaniel; Yan, Chen; Zhu, Rui

Computer Science > Machine Learning

arXiv:2211.11602 (cs)

[Submitted on 21 Nov 2022]

Title:Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Authors:Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Jirka Lhotka, Timothy Lillicrap, Alistair Muldal, George Powell, Adam Santoro, Guy Scully, Sanjana Srivastava, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

View PDF

Abstract:An important goal in artificial intelligence is to create agents that can both interact naturally with humans and learn from their feedback. Here we demonstrate how to use reinforcement learning from human feedback (RLHF) to improve upon simulated, embodied agents trained to a base level of competency with imitation learning. First, we collected data of humans interacting with agents in a simulated 3D world. We then asked annotators to record moments where they believed that agents either progressed toward or regressed from their human-instructed goal. Using this annotation data we leveraged a novel method - which we call "Inter-temporal Bradley-Terry" (IBT) modelling - to build a reward model that captures human judgments. Agents trained to optimise rewards delivered from IBT reward models improved with respect to all of our metrics, including subsequent human judgment during live interactions with agents. Altogether our results demonstrate how one can successfully leverage human judgments to improve agent behaviour, allowing us to use reinforcement learning in complex, embodied domains without programmatic reward functions. Videos of agent behaviour may be found at this https URL.

Subjects:	Machine Learning (cs.LG); Human-Computer Interaction (cs.HC); Multiagent Systems (cs.MA)
Cite as:	arXiv:2211.11602 [cs.LG]
	(or arXiv:2211.11602v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2211.11602

Submission history

From: Federico Carnevale [view email]
[v1] Mon, 21 Nov 2022 16:00:31 UTC (31,149 KB)

Computer Science > Machine Learning

Title:Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators