The "Law" of the Unconscious Contrastive Learner: Probabilistic Alignment of Unpaired Modalities

Che, Yongwei; Eysenbach, Benjamin

Computer Science > Machine Learning

arXiv:2501.11326 (cs)

[Submitted on 20 Jan 2025]

Title:The "Law" of the Unconscious Contrastive Learner: Probabilistic Alignment of Unpaired Modalities

Authors:Yongwei Che, Benjamin Eysenbach

View PDF HTML (experimental)

Abstract:While internet-scale data often comes in pairs (e.g., audio/image, image/text), we often want to perform inferences over modalities unseen together in the training data (e.g., audio/text). Empirically, this can often be addressed by learning multiple contrastive embedding spaces between existing modality pairs, implicitly hoping that unseen modality pairs will end up being aligned. This theoretical paper proves that this hope is well founded, under certain assumptions. Starting with the proper Bayesian approach of integrating out intermediate modalities, we show that directly comparing the representations of data from unpaired modalities can recover the same likelihood ratio. Our analysis builds on prior work on the geometry and probabilistic interpretation of contrastive representations, showing how these representations can answer many of the same inferences as probabilistic graphical models. Our analysis suggests two new ways of using contrastive representations: in settings with pre-trained contrastive models, and for handling language ambiguity in reinforcement learning. Our numerical experiments study the importance of our assumptions and demonstrate these new applications.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2501.11326 [cs.LG]
	(or arXiv:2501.11326v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.11326

Submission history

From: Yongwei Che [view email]
[v1] Mon, 20 Jan 2025 08:10:15 UTC (2,101 KB)

Computer Science > Machine Learning

Title:The "Law" of the Unconscious Contrastive Learner: Probabilistic Alignment of Unpaired Modalities

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The "Law" of the Unconscious Contrastive Learner: Probabilistic Alignment of Unpaired Modalities

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators