Your contrastive learning problem is secretly a distribution alignment problem

Chen, Zihao; Lin, Chi-Heng; Liu, Ran; Xiao, Jingyun; Dyer, Eva L

Computer Science > Machine Learning

arXiv:2502.20141 (cs)

[Submitted on 27 Feb 2025]

Title:Your contrastive learning problem is secretly a distribution alignment problem

Authors:Zihao Chen, Chi-Heng Lin, Ran Liu, Jingyun Xiao, Eva L Dyer

View PDF HTML (experimental)

Abstract:Despite the success of contrastive learning (CL) in vision and language, its theoretical foundations and mechanisms for building representations remain poorly understood. In this work, we build connections between noise contrastive estimation losses widely used in CL and distribution alignment with entropic optimal transport (OT). This connection allows us to develop a family of different losses and multistep iterative variants for existing CL methods. Intuitively, by using more information from the distribution of latents, our approach allows a more distribution-aware manipulation of the relationships within augmented sample sets. We provide theoretical insights and experimental evidence demonstrating the benefits of our approach for {\em generalized contrastive alignment}. Through this framework, it is possible to leverage tools in OT to build unbalanced losses to handle noisy views and customize the representation space by changing the constraints on alignment. By reframing contrastive learning as an alignment problem and leveraging existing optimization tools for OT, our work provides new insights and connections between different self-supervised learning models in addition to new tools that can be more easily adapted to incorporate domain knowledge into learning.

Comments:	10 pages, 5 figures, NeurIPS 2024 submission, includes supplementary material
Subjects:	Machine Learning (cs.LG)
MSC classes:	68T07
ACM classes:	I.2.6
Cite as:	arXiv:2502.20141 [cs.LG]
	(or arXiv:2502.20141v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.20141
Journal reference:	Advances in Neural Information Processing Systems 37 (2025): 91597-91617

Submission history

From: Zihao Chen [view email]
[v1] Thu, 27 Feb 2025 14:33:08 UTC (5,347 KB)

Computer Science > Machine Learning

Title:Your contrastive learning problem is secretly a distribution alignment problem

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Your contrastive learning problem is secretly a distribution alignment problem

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators