AdaCred: Adaptive Causal Decision Transformers with Feature Crediting

Kumawat, Hemant; Mukhopadhyay, Saibal

Computer Science > Machine Learning

arXiv:2412.15427 (cs)

[Submitted on 19 Dec 2024]

Title:AdaCred: Adaptive Causal Decision Transformers with Feature Crediting

Authors:Hemant Kumawat, Saibal Mukhopadhyay

View PDF HTML (experimental)

Abstract:Reinforcement learning (RL) can be formulated as a sequence modeling problem, where models predict future actions based on historical state-action-reward sequences. Current approaches typically require long trajectory sequences to model the environment in offline RL settings. However, these models tend to over-rely on memorizing long-term representations, which impairs their ability to effectively attribute importance to trajectories and learned representations based on task-specific relevance. In this work, we introduce AdaCred, a novel approach that represents trajectories as causal graphs built from short-term action-reward-state sequences. Our model adaptively learns control policy by crediting and pruning low-importance representations, retaining only those most relevant for the downstream task. Our experiments demonstrate that AdaCred-based policies require shorter trajectory sequences and consistently outperform conventional methods in both offline reinforcement learning and imitation learning environments.

Comments:	Accepted to 24th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2025)
Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2412.15427 [cs.LG]
	(or arXiv:2412.15427v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.15427

Submission history

From: Hemant Kumawat [view email]
[v1] Thu, 19 Dec 2024 22:22:37 UTC (11,308 KB)

Computer Science > Machine Learning

Title:AdaCred: Adaptive Causal Decision Transformers with Feature Crediting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:AdaCred: Adaptive Causal Decision Transformers with Feature Crediting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators