Distilling Causal Effect of Data in Class-Incremental Learning

Hu, Xinting; Tang, Kaihua; Miao, Chunyan; Hua, Xian-Sheng; Zhang, Hanwang

Computer Science > Artificial Intelligence

arXiv:2103.01737 (cs)

[Submitted on 2 Mar 2021 (v1), last revised 8 Mar 2021 (this version, v3)]

Title:Distilling Causal Effect of Data in Class-Incremental Learning

Authors:Xinting Hu, Kaihua Tang, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang

View PDF

Abstract:We propose a causal framework to explain the catastrophic forgetting in Class-Incremental Learning (CIL) and then derive a novel distillation method that is orthogonal to the existing anti-forgetting techniques, such as data replay and feature/label distillation. We first 1) place CIL into the framework, 2) answer why the forgetting happens: the causal effect of the old data is lost in new training, and then 3) explain how the existing techniques mitigate it: they bring the causal effect back. Based on the framework, we find that although the feature/label distillation is storage-efficient, its causal effect is not coherent with the end-to-end feature learning merit, which is however preserved by data replay. To this end, we propose to distill the Colliding Effect between the old and the new data, which is fundamentally equivalent to the causal effect of data replay, but without any cost of replay storage. Thanks to the causal effect analysis, we can further capture the Incremental Momentum Effect of the data stream, removing which can help to retain the old effect overwhelmed by the new data effect, and thus alleviate the forgetting of the old class in testing. Extensive experiments on three CIL benchmarks: CIFAR-100, ImageNet-Sub&Full, show that the proposed causal effect distillation can improve various state-of-the-art CIL methods by a large margin (0.72%--9.06%).

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2103.01737 [cs.AI]
	(or arXiv:2103.01737v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2103.01737

Submission history

From: Xinting Hu [view email]
[v1] Tue, 2 Mar 2021 14:14:10 UTC (7,241 KB)
[v2] Thu, 4 Mar 2021 08:37:50 UTC (7,241 KB)
[v3] Mon, 8 Mar 2021 03:16:37 UTC (7,241 KB)

Computer Science > Artificial Intelligence

Title:Distilling Causal Effect of Data in Class-Incremental Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Distilling Causal Effect of Data in Class-Incremental Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators