Real-Time Recurrent Reinforcement Learning

Lemmel, Julian; Grosu, Radu

Computer Science > Machine Learning

arXiv:2311.04830 (cs)

[Submitted on 8 Nov 2023 (v1), last revised 13 Mar 2025 (this version, v3)]

Title:Real-Time Recurrent Reinforcement Learning

Authors:Julian Lemmel, Radu Grosu

View PDF HTML (experimental)

Abstract:We introduce a biologically plausible RL framework for solving tasks in partially observable Markov decision processes (POMDPs). The proposed algorithm combines three integral parts: (1) A Meta-RL architecture, resembling the mammalian basal ganglia; (2) A biologically plausible reinforcement learning algorithm, exploiting temporal difference learning and eligibility traces to train the policy and the value-function; (3) An online automatic differentiation algorithm for computing the gradients with respect to parameters of a shared recurrent network backbone. Our experimental results show that the method is capable of solving a diverse set of partially observable reinforcement learning tasks. The algorithm we call real-time recurrent reinforcement learning (RTRRL) serves as a model of learning in biological neural networks, mimicking reward pathways in the basal ganglia.

Comments:	14 pages, 9 figures, includes Appendix
Subjects:	Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE); Systems and Control (eess.SY)
Cite as:	arXiv:2311.04830 [cs.LG]
	(or arXiv:2311.04830v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2311.04830

Submission history

From: Julian Lemmel [view email]
[v1] Wed, 8 Nov 2023 16:56:16 UTC (3,934 KB)
[v2] Thu, 28 Mar 2024 10:30:57 UTC (4,065 KB)
[v3] Thu, 13 Mar 2025 10:19:32 UTC (1,085 KB)

Computer Science > Machine Learning

Title:Real-Time Recurrent Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Real-Time Recurrent Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators