Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces

Ota, Toshihiro

Computer Science > Machine Learning

arXiv:2403.19925 (cs)

[Submitted on 29 Mar 2024]

Title:Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces

Authors:Toshihiro Ota

View PDF HTML (experimental)

Abstract:Decision Transformer, a promising approach that applies Transformer architectures to reinforcement learning, relies on causal self-attention to model sequences of states, actions, and rewards. While this method has shown competitive results, this paper investigates the integration of the Mamba framework, known for its advanced capabilities in efficient and effective sequence modeling, into the Decision Transformer architecture, focusing on the potential performance enhancements in sequential decision-making tasks. Our study systematically evaluates this integration by conducting a series of experiments across various decision-making environments, comparing the modified Decision Transformer, Decision Mamba, with its traditional counterpart. This work contributes to the advancement of sequential decision-making models, suggesting that the architecture and training methodology of neural networks can significantly impact their performance in complex tasks, and highlighting the potential of Mamba as a valuable tool for improving the efficacy of Transformer-based models in reinforcement learning scenarios.

Comments:	8 pages, 1 figure
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Report number:	RIKEN-iTHEMS-Report-24
Cite as:	arXiv:2403.19925 [cs.LG]
	(or arXiv:2403.19925v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.19925

Submission history

From: Toshihiro Ota [view email]
[v1] Fri, 29 Mar 2024 02:25:55 UTC (134 KB)

Computer Science > Machine Learning

Title:Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators