A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds

Cui, Christopher Z.; Peng, Xiangyu; Riedl, Mark O.

Computer Science > Computation and Language

arXiv:2405.06059 (cs)

[Submitted on 9 May 2024]

Title:A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds

Authors:Christopher Z. Cui, Xiangyu Peng, Mark O. Riedl

View PDF HTML (experimental)

Abstract:Open-ended worlds are those in which there are no pre-specified goals or environmental reward signal. As a consequence, an agent must know how to perform a multitude of tasks. However, when a new task is presented to an agent, we expect it to be able to reuse some of what it knows from previous tasks to rapidly learn that new task. We introduce a novel technique whereby policies for different a priori known tasks are combined into a Mixture-of-Experts model with an attention mechanism across a mix of frozen and unfrozen experts. The model learns when to attend to frozen task-specific experts when appropriate and learns new experts to handle novel situations. We work in an open-ended text-based environment in which the agent is tasked with behaving like different types of character roles and must rapidly learn behaviors associated with new character role types. We show that our agent both obtains more rewards in the zero-shot setting, and discovers these rewards with greater sample efficiency in the few-shot learning settings.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.06059 [cs.CL]
	(or arXiv:2405.06059v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2405.06059

Submission history

From: Christopher Cui [view email]
[v1] Thu, 9 May 2024 19:02:56 UTC (3,665 KB)

Computer Science > Computation and Language

Title:A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators