Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Cabannes, Vivien; Bottou, Leon; Lecun, Yann; Balestriero, Randall

Computer Science > Machine Learning

arXiv:2303.15256 (cs)

[Submitted on 27 Mar 2023 (v1), last revised 29 Sep 2023 (this version, v2)]

Title:Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Authors:Vivien Cabannes, Leon Bottou, Yann Lecun, Randall Balestriero

View PDF

Abstract:Self-Supervised Learning (SSL) has emerged as the solution of choice to learn transferable representations from unlabeled data. However, SSL requires to build samples that are known to be semantically akin, i.e. positive views. Requiring such knowledge is the main limitation of SSL and is often tackled by ad-hoc strategies e.g. applying known data-augmentations to the same input. In this work, we formalize and generalize this principle through Positive Active Learning (PAL) where an oracle queries semantic relationships between samples. PAL achieves three main objectives. First, it unveils a theoretically grounded learning framework beyond SSL, based on similarity graphs, that can be extended to tackle supervised and semi-supervised learning depending on the employed oracle. Second, it provides a consistent algorithm to embed a priori knowledge, e.g. some observed labels, into any SSL losses without any change in the training pipeline. Third, it provides a proper active learning framework yielding low-cost solutions to annotate datasets, arguably bringing the gap between theory and practice of active learning that is based on simple-to-answer-by-non-experts queries of semantic relationships between inputs.

Comments:	8 main pages, 20 totals, 10 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
ACM classes:	I.2.6
Cite as:	arXiv:2303.15256 [cs.LG]
	(or arXiv:2303.15256v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2303.15256

Submission history

From: Vivien Cabannes [view email]
[v1] Mon, 27 Mar 2023 14:44:39 UTC (5,491 KB)
[v2] Fri, 29 Sep 2023 08:30:32 UTC (4,226 KB)

Computer Science > Machine Learning

Title:Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators