Born Again Neural Networks

Furlanello, Tommaso; Lipton, Zachary C.; Tschannen, Michael; Itti, Laurent; Anandkumar, Anima

Statistics > Machine Learning

arXiv:1805.04770 (stat)

[Submitted on 12 May 2018 (v1), last revised 29 Jun 2018 (this version, v2)]

Title:Born Again Neural Networks

Authors:Tommaso Furlanello, Zachary C. Lipton, Michael Tschannen, Laurent Itti, Anima Anandkumar

View PDF

Abstract:Knowledge Distillation (KD) consists of transferring â€œknowledgeâ€ from one machine learning model (the teacher) to another (the student). Commonly, the teacher is a high-capacity model with formidable performance, while the student is more compact. By transferring knowledge, one hopes to benefit from the studentâ€™s compactness, without sacrificing too much performance. We study KD from a new perspective: rather than compressing models, we train students parameterized identically to their teachers. Surprisingly, these Born-Again Networks (BANs), outperform their teachers significantly, both on computer vision and language modeling tasks. Our experiments with BANs based on DenseNets demonstrate state-of-the-art performance on the CIFAR-10 (3.5%) and CIFAR-100 (15.5%) datasets, by validation error. Additional experiments explore two distillation objectives: (i) Confidence-Weighted by Teacher Max (CWTM) and (ii) Dark Knowledge with Permuted Predictions (DKPP). Both methods elucidate the essential components of KD, demonstrating the effect of the teacher outputs on both predicted and non-predicted classes.

Comments:	Published @ICML 2018
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:1805.04770 [stat.ML]
	(or arXiv:1805.04770v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1805.04770

Submission history

From: Tommaso Furlanello [view email]
[v1] Sat, 12 May 2018 19:48:50 UTC (586 KB)
[v2] Fri, 29 Jun 2018 10:46:28 UTC (586 KB)

Statistics > Machine Learning

Title:Born Again Neural Networks

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Born Again Neural Networks

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators