Continual Learning With Quasi-Newton Methods

Eeckt, Steven Vander; Van hamme, Hugo

doi:10.1109/ACCESS.2025.3551146

Computer Science > Machine Learning

arXiv:2503.19939 (cs)

[Submitted on 25 Mar 2025]

Title:Continual Learning With Quasi-Newton Methods

Authors:Steven Vander Eeckt, Hugo Van hamme

View PDF HTML (experimental)

Abstract:Catastrophic forgetting remains a major challenge when neural networks learn tasks sequentially. Elastic Weight Consolidation (EWC) attempts to address this problem by introducing a Bayesian-inspired regularization loss to preserve knowledge of previously learned tasks. However, EWC relies on a Laplace approximation where the Hessian is simplified to the diagonal of the Fisher information matrix, assuming uncorrelated model parameters. This overly simplistic assumption often leads to poor Hessian estimates, limiting its effectiveness. To overcome this limitation, we introduce Continual Learning with Sampled Quasi-Newton (CSQN), which leverages Quasi-Newton methods to compute more accurate Hessian approximations. CSQN captures parameter interactions beyond the diagonal without requiring architecture-specific modifications, making it applicable across diverse tasks and architectures. Experimental results across four benchmarks demonstrate that CSQN consistently outperforms EWC and other state-of-the-art baselines, including rehearsal-based methods. CSQN reduces EWC's forgetting by 50 percent and improves its performance by 8 percent on average. Notably, CSQN achieves superior results on three out of four benchmarks, including the most challenging scenarios, highlighting its potential as a robust solution for continual learning.

Comments:	Published in IEEE Access
Subjects:	Machine Learning (cs.LG); Image and Video Processing (eess.IV)
Cite as:	arXiv:2503.19939 [cs.LG]
	(or arXiv:2503.19939v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.19939
Journal reference:	IEEE Access, vol. 13, pp. 47485-47499, 2025
Related DOI:	https://doi.org/10.1109/ACCESS.2025.3551146

Submission history

From: Steven Vander Eeckt [view email]
[v1] Tue, 25 Mar 2025 07:45:59 UTC (3,003 KB)

Computer Science > Machine Learning

Title:Continual Learning With Quasi-Newton Methods

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Continual Learning With Quasi-Newton Methods

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators