A Probabilistic Model for Self-Supervised Learning

Fleissner, Maximilian; Esser, Pascal; Ghoshdastidar, Debarghya

Computer Science > Machine Learning

arXiv:2501.13031 (cs)

[Submitted on 22 Jan 2025]

Title:A Probabilistic Model for Self-Supervised Learning

Authors:Maximilian Fleissner, Pascal Esser, Debarghya Ghoshdastidar

View PDF HTML (experimental)

Abstract:Self-supervised learning (SSL) aims to find meaningful representations from unlabeled data by encoding semantic similarities through data augmentations. Despite its current popularity, theoretical insights about SSL are still scarce. For example, it is not yet known whether commonly used SSL loss functions can be related to a statistical model, much in the same as OLS, generalized linear models or PCA naturally emerge as maximum likelihood estimates of an underlying generative process. In this short paper, we consider a latent variable statistical model for SSL that exhibits an interesting property: Depending on the informativeness of the data augmentations, the MLE of the model either reduces to PCA, or approaches a simple non-contrastive loss. We analyze the model and also empirically illustrate our findings.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2501.13031 [cs.LG]
	(or arXiv:2501.13031v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.13031

Submission history

From: Pascal Mattia Esser [view email]
[v1] Wed, 22 Jan 2025 17:25:47 UTC (186 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2025-01

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:A Probabilistic Model for Self-Supervised Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Probabilistic Model for Self-Supervised Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators