Decision Mamba Architectures

Correia, André; Alexandre, Luís A.

Computer Science > Machine Learning

arXiv:2405.07943 (cs)

[Submitted on 13 May 2024 (v1), last revised 17 Oct 2024 (this version, v2)]

Title:Decision Mamba Architectures

Authors:André Correia, Luís A. Alexandre

View PDF HTML (experimental)

Abstract:Recent advancements in imitation learning have been largely fueled by the integration of sequence models, which provide a structured flow of information to effectively mimic task behaviours. Currently, Decision Transformer (DT) and subsequently, the Hierarchical Decision Transformer (HDT), presented Transformer-based approaches to learn task policies. Recently, the Mamba architecture has shown to outperform Transformers across various task domains. In this work, we introduce two novel methods, Decision Mamba (DM) and Hierarchical Decision Mamba (HDM), aimed at enhancing the performance of the Transformer models. Through extensive experimentation across diverse environments such as OpenAI Gym and D4RL, leveraging varying demonstration data sets, we demonstrate the superiority of Mamba models over their Transformer counterparts in a majority of tasks. Results show that DM outperforms other methods in most settings. The code can be found at this https URL.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2405.07943 [cs.LG]
	(or arXiv:2405.07943v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.07943

Submission history

From: André Correia [view email]
[v1] Mon, 13 May 2024 17:18:08 UTC (86 KB)
[v2] Thu, 17 Oct 2024 09:48:06 UTC (114 KB)

Computer Science > Machine Learning

Title:Decision Mamba Architectures

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Decision Mamba Architectures

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators