Quantifying Representation Reliability in Self-Supervised Learning Models

Park, Young-Jin; Wang, Hao; Ardeshir, Shervin; Azizan, Navid

Computer Science > Machine Learning

arXiv:2306.00206 (cs)

[Submitted on 31 May 2023 (v1), last revised 17 May 2024 (this version, v2)]

Title:Quantifying Representation Reliability in Self-Supervised Learning Models

Authors:Young-Jin Park, Hao Wang, Shervin Ardeshir, Navid Azizan

View PDF HTML (experimental)

Abstract:Self-supervised learning models extract general-purpose representations from data. Quantifying the reliability of these representations is crucial, as many downstream models rely on them as input for their own tasks. To this end, we introduce a formal definition of representation reliability: the representation for a given test point is considered to be reliable if the downstream models built on top of that representation can consistently generate accurate predictions for that test point. However, accessing downstream data to quantify the representation reliability is often infeasible or restricted due to privacy concerns. We propose an ensemble-based method for estimating the representation reliability without knowing the downstream tasks a priori. Our method is based on the concept of neighborhood consistency across distinct pre-trained representation spaces. The key insight is to find shared neighboring points as anchors to align these representation spaces before comparing them. We demonstrate through comprehensive numerical experiments that our method effectively captures the representation reliability with a high degree of correlation, achieving robust and favorable performance compared with baseline methods.

Comments:	Presented in UAI 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.00206 [cs.LG]
	(or arXiv:2306.00206v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.00206

Submission history

From: Young-Jin Park [view email]
[v1] Wed, 31 May 2023 21:57:33 UTC (3,898 KB)
[v2] Fri, 17 May 2024 18:48:24 UTC (3,335 KB)

Computer Science > Machine Learning

Title:Quantifying Representation Reliability in Self-Supervised Learning Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Quantifying Representation Reliability in Self-Supervised Learning Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators