Linked Adapters: Linking Past and Future to Present for Effective Continual Learning

Chandra, Dupati Srikar; Srijith, P. K.; Rezazadegan, Dana; McCarthy, Chris

Computer Science > Machine Learning

arXiv:2412.10687 (cs)

[Submitted on 14 Dec 2024]

Title:Linked Adapters: Linking Past and Future to Present for Effective Continual Learning

Authors:Dupati Srikar Chandra, P. K. Srijith, Dana Rezazadegan, Chris McCarthy

View PDF HTML (experimental)

Abstract:Continual learning allows the system to learn and adapt to new tasks while retaining the knowledge acquired from previous tasks. However, deep learning models suffer from catastrophic forgetting of knowledge learned from earlier tasks while learning a new task. Moreover, retraining large models like transformers from scratch for every new task is costly. An effective approach to address continual learning is to use a large pre-trained model with task-specific adapters to adapt to the new tasks. Though this approach can mitigate catastrophic forgetting, they fail to transfer knowledge across tasks as each task is learning adapters separately. To address this, we propose a novel approach Linked Adapters that allows knowledge transfer through a weighted attention mechanism to other task-specific adapters. Linked adapters use a multi-layer perceptron (MLP) to model the attention weights, which overcomes the challenge of backward knowledge transfer in continual learning in addition to modeling the forward knowledge transfer. During inference, our proposed approach effectively leverages knowledge transfer through MLP-based attention weights across all the lateral task adapters. Through numerous experiments conducted on diverse image classification datasets, we effectively demonstrated the improvement in performance on the continual learning tasks using Linked Adapters.

Comments:	13 Pages, 5 Figures
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.10687 [cs.LG]
	(or arXiv:2412.10687v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.10687

Submission history

From: Srikar Chandra Dupati [view email]
[v1] Sat, 14 Dec 2024 05:25:17 UTC (10,254 KB)

Computer Science > Machine Learning

Title:Linked Adapters: Linking Past and Future to Present for Effective Continual Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Linked Adapters: Linking Past and Future to Present for Effective Continual Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators