Transfusion: Understanding Transfer Learning with Applications to Medical Imaging

Raghu, Maithra; Zhang, Chiyuan; Kleinberg, Jon; Bengio, Samy

Computer Science > Computer Vision and Pattern Recognition

arXiv:1902.07208v1 (cs)

[Submitted on 14 Feb 2019 (this version), latest version 29 Oct 2019 (v3)]

Title:Transfusion: Understanding Transfer Learning with Applications to Medical Imaging

Authors:Maithra Raghu, Chiyuan Zhang, Jon Kleinberg, Samy Bengio

View PDF

Abstract:With the increasingly varied applications of deep learning, transfer learning has emerged as a critically important technique. However, the central question of how much feature reuse in transfer is the source of benefit remains unanswered. In this paper, we present an in-depth analysis of the effects of transfer, focusing on medical imaging, which is a particularly intriguing setting. Here, transfer learning is extremely popular, but data differences between pretraining and finetuing are considerable, reiterating the question of what is transferred. With experiments on two large scale medical imaging datasets, and CIFAR-10, we find transfer has almost negligible effects on performance, but significantly helps convergence speed. However, in all of these settings, convergence without transfer can be sped up dramatically by using only mean and variance statistics of the pretrained weights. Visualizing the lower layer filters shows that models trained from random initialization do not learn Gabor filters on medical images. We use CCA (canonical correlation analysis) to study the learned representations of the different models, finding that pretrained models are surprisingly similar to random initialization at higher layers. This similarity is evidenced both through model learning dynamics and a transfusion experiment, which explores the convergence speed using a subset of pretrained weights.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1902.07208 [cs.CV]
	(or arXiv:1902.07208v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1902.07208

Submission history

From: Chiyuan Zhang [view email]
[v1] Thu, 14 Feb 2019 01:54:53 UTC (4,207 KB)
[v2] Thu, 30 May 2019 00:37:48 UTC (8,278 KB)
[v3] Tue, 29 Oct 2019 20:16:30 UTC (4,397 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Transfusion: Understanding Transfer Learning with Applications to Medical Imaging

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Transfusion: Understanding Transfer Learning with Applications to Medical Imaging

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators