Deep Reference Priors: What is the best way to pretrain a model?

Gao, Yansong; Ramesh, Rahul; Chaudhari, Pratik

Statistics > Machine Learning

arXiv:2202.00187 (stat)

[Submitted on 1 Feb 2022 (v1), last revised 15 Jun 2022 (this version, v2)]

Title:Deep Reference Priors: What is the best way to pretrain a model?

Authors:Yansong Gao, Rahul Ramesh, Pratik Chaudhari

View PDF

Abstract:What is the best way to exploit extra data -- be it unlabeled data from the same task, or labeled data from a related task -- to learn a given task? This paper formalizes the question using the theory of reference priors. Reference priors are objective, uninformative Bayesian priors that maximize the mutual information between the task and the weights of the model. Such priors enable the task to maximally affect the Bayesian posterior, e.g., reference priors depend upon the number of samples available for learning the task and for very small sample sizes, the prior puts more probability mass on low-complexity models in the hypothesis space. This paper presents the first demonstration of reference priors for medium-scale deep networks and image-based data. We develop generalizations of reference priors and demonstrate applications to two problems. First, by using unlabeled data to compute the reference prior, we develop new Bayesian semi-supervised learning methods that remain effective even with very few samples per class. Second, by using labeled data from the source task to compute the reference prior, we develop a new pretraining method for transfer learning that allows data from the target task to maximally affect the Bayesian posterior. Empirical validation of these methods is conducted on image classification datasets. Code is available at this https URL.

Comments:	24 pages
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2202.00187 [stat.ML]
	(or arXiv:2202.00187v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2202.00187

Submission history

From: Rahul Ramesh [view email]
[v1] Tue, 1 Feb 2022 02:32:39 UTC (8,518 KB)
[v2] Wed, 15 Jun 2022 22:06:45 UTC (8,487 KB)

Statistics > Machine Learning

Title:Deep Reference Priors: What is the best way to pretrain a model?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Deep Reference Priors: What is the best way to pretrain a model?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators