MagMax: Leveraging Model Merging for Seamless Continual Learning

Marczak, Daniel; Twardowski, Bartłomiej; Trzciński, Tomasz; Cygert, Sebastian

Computer Science > Machine Learning

arXiv:2407.06322 (cs)

[Submitted on 8 Jul 2024 (v1), last revised 29 Jul 2024 (this version, v2)]

Title:MagMax: Leveraging Model Merging for Seamless Continual Learning

Authors:Daniel Marczak, Bartłomiej Twardowski, Tomasz Trzciński, Sebastian Cygert

View PDF HTML (experimental)

Abstract:This paper introduces a continual learning approach named MagMax, which utilizes model merging to enable large pre-trained models to continuously learn from new data without forgetting previously acquired knowledge. Distinct from traditional continual learning methods that aim to reduce forgetting during task training, MagMax combines sequential fine-tuning with a maximum magnitude weight selection for effective knowledge integration across tasks. Our initial contribution is an extensive examination of model merging techniques, revealing that simple approaches like weight averaging and random weight selection surprisingly hold up well in various continual learning contexts. More importantly, we present MagMax, a novel model-merging strategy that enables continual learning of large pre-trained models for successive tasks. Our thorough evaluation demonstrates the superiority of MagMax in various scenarios, including class- and domain-incremental learning settings. The code is available at this URL: this https URL.

Comments:	Accepted for ECCV2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.06322 [cs.LG]
	(or arXiv:2407.06322v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.06322

Submission history

From: Daniel Marczak [view email]
[v1] Mon, 8 Jul 2024 18:38:52 UTC (1,204 KB)
[v2] Mon, 29 Jul 2024 22:17:31 UTC (1,513 KB)

Computer Science > Machine Learning

Title:MagMax: Leveraging Model Merging for Seamless Continual Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:MagMax: Leveraging Model Merging for Seamless Continual Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators