BECAME: BayEsian Continual Learning with Adaptive Model MErging

Li, Mei; Lu, Yuxiang; Dai, Qinyan; Huang, Suizhi; Ding, Yue; Lu, Hongtao

Computer Science > Machine Learning

arXiv:2504.02666 (cs)

[Submitted on 3 Apr 2025]

Title:BECAME: BayEsian Continual Learning with Adaptive Model MErging

Authors:Mei Li, Yuxiang Lu, Qinyan Dai, Suizhi Huang, Yue Ding, Hongtao Lu

View PDF HTML (experimental)

Abstract:Continual Learning (CL) strives to learn incrementally across tasks while mitigating catastrophic forgetting. A key challenge in CL is balancing stability (retaining prior knowledge) and plasticity (learning new tasks). While representative gradient projection methods ensure stability, they often limit plasticity. Model merging techniques offer promising solutions, but prior methods typically rely on empirical assumptions and carefully selected hyperparameters. In this paper, we explore the potential of model merging to enhance the stability-plasticity trade-off, providing theoretical insights that underscore its benefits. Specifically, we reformulate the merging mechanism using Bayesian continual learning principles and derive a closed-form solution for the optimal merging coefficient that adapts to the diverse characteristics of tasks. To validate our approach, we introduce a two-stage framework named BECAME, which synergizes the expertise of gradient projection and adaptive merging. Extensive experiments show that our approach outperforms state-of-the-art CL methods and existing merging strategies.

Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2504.02666 [cs.LG]
	(or arXiv:2504.02666v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2504.02666

Submission history

From: Mei Li [view email]
[v1] Thu, 3 Apr 2025 15:07:28 UTC (3,239 KB)

Computer Science > Machine Learning

Title:BECAME: BayEsian Continual Learning with Adaptive Model MErging

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:BECAME: BayEsian Continual Learning with Adaptive Model MErging

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators