Parseval Regularization for Continual Reinforcement Learning

Chung, Wesley; Cherif, Lynn; Meger, David; Precup, Doina

Computer Science > Machine Learning

arXiv:2412.07224 (cs)

[Submitted on 10 Dec 2024]

Title:Parseval Regularization for Continual Reinforcement Learning

Authors:Wesley Chung, Lynn Cherif, David Meger, Doina Precup

View PDF HTML (experimental)

Abstract:Loss of plasticity, trainability loss, and primacy bias have been identified as issues arising when training deep neural networks on sequences of tasks -- all referring to the increased difficulty in training on new tasks. We propose to use Parseval regularization, which maintains orthogonality of weight matrices, to preserve useful optimization properties and improve training in a continual reinforcement learning setting. We show that it provides significant benefits to RL agents on a suite of gridworld, CARL and MetaWorld tasks. We conduct comprehensive ablations to identify the source of its benefits and investigate the effect of certain metrics associated to network trainability including weight matrix rank, weight norms and policy entropy.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.07224 [cs.LG]
	(or arXiv:2412.07224v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.07224

Submission history

From: Wesley Chung [view email]
[v1] Tue, 10 Dec 2024 06:19:21 UTC (11,233 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2024-12

Change to browse by:

cs
cs.AI

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Parseval Regularization for Continual Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Parseval Regularization for Continual Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators