Learning Memory Mechanisms for Decision Making through Demonstrations

Yue, William; Liu, Bo; Stone, Peter

Computer Science > Machine Learning

arXiv:2411.07954 (cs)

[Submitted on 12 Nov 2024 (v1), last revised 13 Nov 2024 (this version, v2)]

Title:Learning Memory Mechanisms for Decision Making through Demonstrations

Authors:William Yue, Bo Liu, Peter Stone

View PDF HTML (experimental)

Abstract:In Partially Observable Markov Decision Processes, integrating an agent's history into memory poses a significant challenge for decision-making. Traditional imitation learning, relying on observation-action pairs for expert demonstrations, fails to capture the expert's memory mechanisms used in decision-making. To capture memory processes as demonstrations, we introduce the concept of memory dependency pairs $(p, q)$ indicating that events at time $p$ are recalled for decision-making at time $q$. We introduce AttentionTuner to leverage memory dependency pairs in Transformers and find significant improvements across several tasks compared to standard Transformers when evaluated on Memory Gym and the Long-term Memory Benchmark. Code is available at this https URL.

Subjects:	Machine Learning (cs.LG); Robotics (cs.RO)
Cite as:	arXiv:2411.07954 [cs.LG]
	(or arXiv:2411.07954v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.07954

Submission history

From: William Yue [view email]
[v1] Tue, 12 Nov 2024 17:30:31 UTC (11,067 KB)
[v2] Wed, 13 Nov 2024 02:56:56 UTC (11,067 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-11

Change to browse by:

cs
cs.RO

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Learning Memory Mechanisms for Decision Making through Demonstrations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning Memory Mechanisms for Decision Making through Demonstrations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators