Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning

Wang, Mianchu; Jin, Yue; Montana, Giovanni

Computer Science > Machine Learning

arXiv:2412.03258 (cs)

[Submitted on 4 Dec 2024]

Title:Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning

Authors:Mianchu Wang, Yue Jin, Giovanni Montana

View PDF HTML (experimental)

Abstract:Offline reinforcement learning (RL) seeks to learn optimal policies from static datasets without interacting with the environment. A common challenge is handling multi-modal action distributions, where multiple behaviours are represented in the data. Existing methods often assume unimodal behaviour policies, leading to suboptimal performance when this assumption is violated. We propose Weighted Imitation Learning on One Mode (LOM), a novel approach that focuses on learning from a single, promising mode of the behaviour policy. By using a Gaussian mixture model to identify modes and selecting the best mode based on expected returns, LOM avoids the pitfalls of averaging over conflicting actions. Theoretically, we show that LOM improves performance while maintaining simplicity in policy learning. Empirically, LOM outperforms existing methods on standard D4RL benchmarks and demonstrates its effectiveness in complex, multi-modal scenarios.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2412.03258 [cs.LG]
	(or arXiv:2412.03258v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.03258

Submission history

From: Mianchu Wang [view email]
[v1] Wed, 4 Dec 2024 11:57:36 UTC (5,638 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-12

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators