Improving Pre-Trained Self-Supervised Embeddings Through Effective Entropy Maximization

Chakraborty, Deep; LeCun, Yann; Rudner, Tim G. J.; Learned-Miller, Erik

Computer Science > Machine Learning

arXiv:2411.15931 (cs)

[Submitted on 24 Nov 2024]

Title:Improving Pre-Trained Self-Supervised Embeddings Through Effective Entropy Maximization

Authors:Deep Chakraborty, Yann LeCun, Tim G. J. Rudner, Erik Learned-Miller

View PDF

Abstract:A number of different architectures and loss functions have been applied to the problem of self-supervised learning (SSL), with the goal of developing embeddings that provide the best possible pre-training for as-yet-unknown, lightly supervised downstream tasks. One of these SSL criteria is to maximize the entropy of a set of embeddings in some compact space. But the goal of maximizing the embedding entropy often depends--whether explicitly or implicitly--upon high dimensional entropy estimates, which typically perform poorly in more than a few dimensions. In this paper, we motivate an effective entropy maximization criterion (E2MC), defined in terms of easy-to-estimate, low-dimensional constraints. We demonstrate that using it to continue training an already-trained SSL model for only a handful of epochs leads to a consistent and, in some cases, significant improvement in downstream performance. We perform careful ablation studies to show that the improved performance is due to the proposed add-on criterion. We also show that continued pre-training with alternative criteria does not lead to notable improvements, and in some cases, even degrades performance.

Comments:	19 pages including appendix, 5 figures
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Information Theory (cs.IT); Applications (stat.AP); Machine Learning (stat.ML)
Cite as:	arXiv:2411.15931 [cs.LG]
	(or arXiv:2411.15931v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.15931

Submission history

From: Deep Chakraborty [view email]
[v1] Sun, 24 Nov 2024 17:38:23 UTC (1,059 KB)

Computer Science > Machine Learning

Title:Improving Pre-Trained Self-Supervised Embeddings Through Effective Entropy Maximization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Improving Pre-Trained Self-Supervised Embeddings Through Effective Entropy Maximization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators