Generalization Analysis for Deep Contrastive Representation Learning

Hieu, Nong Minh; Ledent, Antoine; Lei, Yunwen; Ku, Cheng Yeaw

Statistics > Machine Learning

arXiv:2412.12014 (stat)

[Submitted on 16 Dec 2024 (v1), last revised 19 Dec 2024 (this version, v2)]

Title:Generalization Analysis for Deep Contrastive Representation Learning

Authors:Nong Minh Hieu, Antoine Ledent, Yunwen Lei, Cheng Yeaw Ku

View PDF

Abstract:In this paper, we present generalization bounds for the unsupervised risk in the Deep Contrastive Representation Learning framework, which employs deep neural networks as representation functions. We approach this problem from two angles. On the one hand, we derive a parameter-counting bound that scales with the overall size of the neural networks. On the other hand, we provide a norm-based bound that scales with the norms of neural networks' weight matrices. Ignoring logarithmic factors, the bounds are independent of $k$, the size of the tuples provided for contrastive learning. To the best of our knowledge, this property is only shared by one other work, which employed a different proof strategy and suffers from very strong exponential dependence on the depth of the network which is due to a use of the peeling technique. Our results circumvent this by leveraging powerful results on covering numbers with respect to uniform norms over samples. In addition, we utilize loss augmentation techniques to further reduce the dependency on matrix norms and the implicit dependence on network depth. In fact, our techniques allow us to produce many bounds for the contrastive learning setting with similar architectural dependencies as in the study of the sample complexity of ordinary loss functions, thereby bridging the gap between the learning theories of contrastive learning and DNNs.

Comments:	Accepted at AAAI 2025
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2412.12014 [stat.ML]
	(or arXiv:2412.12014v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2412.12014

Submission history

From: Minh Hieu Nong [view email]
[v1] Mon, 16 Dec 2024 17:40:05 UTC (298 KB)
[v2] Thu, 19 Dec 2024 06:21:35 UTC (299 KB)

Statistics > Machine Learning

Title:Generalization Analysis for Deep Contrastive Representation Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Generalization Analysis for Deep Contrastive Representation Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators