Measuring Catastrophic Forgetting in Neural Networks

Kemker, Ronald; McClure, Marc; Abitino, Angelina; Hayes, Tyler; Kanan, Christopher

Computer Science > Artificial Intelligence

arXiv:1708.02072 (cs)

[Submitted on 7 Aug 2017 (v1), last revised 9 Nov 2017 (this version, v4)]

Title:Measuring Catastrophic Forgetting in Neural Networks

Authors:Ronald Kemker, Marc McClure, Angelina Abitino, Tyler Hayes, Christopher Kanan

View PDF

Abstract:Deep neural networks are used in many state-of-the-art systems for machine perception. Once a network is trained to do a specific task, e.g., bird classification, it cannot easily be trained to do new tasks, e.g., incrementally learning to recognize additional bird species or learning an entirely different task such as flower recognition. When new tasks are added, typical deep neural networks are prone to catastrophically forgetting previous tasks. Networks that are capable of assimilating new information incrementally, much like how humans form new memories over time, will be more efficient than re-training the model from scratch each time a new task needs to be learned. There have been multiple attempts to develop schemes that mitigate catastrophic forgetting, but these methods have not been directly compared, the tests used to evaluate them vary considerably, and these methods have only been evaluated on small-scale problems (e.g., MNIST). In this paper, we introduce new metrics and benchmarks for directly comparing five different mechanisms designed to mitigate catastrophic forgetting in neural networks: regularization, ensembling, rehearsal, dual-memory, and sparse-coding. Our experiments on real-world images and sounds show that the mechanism(s) that are critical for optimal performance vary based on the incremental training paradigm and type of data being used, but they all demonstrate that the catastrophic forgetting problem has yet to be solved.

Comments:	To appear in AAAI 2018
Subjects:	Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:1708.02072 [cs.AI]
	(or arXiv:1708.02072v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1708.02072

Submission history

From: Ronald Kemker [view email]
[v1] Mon, 7 Aug 2017 11:18:43 UTC (213 KB)
[v2] Tue, 8 Aug 2017 09:33:24 UTC (218 KB)
[v3] Mon, 11 Sep 2017 16:50:39 UTC (1,179 KB)
[v4] Thu, 9 Nov 2017 14:53:07 UTC (1,225 KB)

Computer Science > Artificial Intelligence

Title:Measuring Catastrophic Forgetting in Neural Networks

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Measuring Catastrophic Forgetting in Neural Networks

Submission history

Access Paper:

References & Citations

3 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators