Continual Learning using a Bayesian Nonparametric Dictionary of Weight Factors

Mehta, Nikhil; Liang, Kevin J; Verma, Vinay K; Carin, Lawrence

Computer Science > Machine Learning

arXiv:2004.10098 (cs)

[Submitted on 21 Apr 2020 (v1), last revised 27 Apr 2021 (this version, v3)]

Title:Continual Learning using a Bayesian Nonparametric Dictionary of Weight Factors

Authors:Nikhil Mehta, Kevin J Liang, Vinay K Verma, Lawrence Carin

View PDF

Abstract:Naively trained neural networks tend to experience catastrophic forgetting in sequential task settings, where data from previous tasks are unavailable. A number of methods, using various model expansion strategies, have been proposed recently as possible solutions. However, determining how much to expand the model is left to the practitioner, and often a constant schedule is chosen for simplicity, regardless of how complex the incoming task is. Instead, we propose a principled Bayesian nonparametric approach based on the Indian Buffet Process (IBP) prior, letting the data determine how much to expand the model complexity. We pair this with a factorization of the neural network's weight matrices. Such an approach allows the number of factors of each weight matrix to scale with the complexity of the task, while the IBP prior encourages sparse weight factor selection and factor reuse, promoting positive knowledge transfer between tasks. We demonstrate the effectiveness of our method on a number of continual learning benchmarks and analyze how weight factors are allocated and reused throughout the training.

Comments:	Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS) 2021 Post-conference updates: Fixed typo in equation (11) and updated references
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2004.10098 [cs.LG]
	(or arXiv:2004.10098v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2004.10098

Submission history

From: Nikhil Mehta [view email]
[v1] Tue, 21 Apr 2020 15:20:19 UTC (962 KB)
[v2] Fri, 5 Mar 2021 04:08:52 UTC (1,820 KB)
[v3] Tue, 27 Apr 2021 23:28:11 UTC (3,869 KB)

Computer Science > Machine Learning

Title:Continual Learning using a Bayesian Nonparametric Dictionary of Weight Factors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Continual Learning using a Bayesian Nonparametric Dictionary of Weight Factors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators