Revisiting semi-supervised learning in the era of foundation models

Zhang, Ping; Mai, Zheda; Nguyen, Quang-Huy; Chao, Wei-Lun

Computer Science > Machine Learning

arXiv:2503.09707 (cs)

[Submitted on 12 Mar 2025]

Title:Revisiting semi-supervised learning in the era of foundation models

Authors:Ping Zhang, Zheda Mai, Quang-Huy Nguyen, Wei-Lun Chao

View PDF HTML (experimental)

Abstract:Semi-supervised learning (SSL) leverages abundant unlabeled data alongside limited labeled data to enhance learning. As vision foundation models (VFMs) increasingly serve as the backbone of vision applications, it remains unclear how SSL interacts with these pre-trained models. To address this gap, we develop new SSL benchmark datasets where frozen VFMs underperform and systematically evaluate representative SSL methods. We make a surprising observation: parameter-efficient fine-tuning (PEFT) using only labeled data often matches SSL performance, even without leveraging unlabeled data. This motivates us to revisit self-training, a conceptually simple SSL baseline, where we use the supervised PEFT model to pseudo-label unlabeled data for further training. To overcome the notorious issue of noisy pseudo-labels, we propose ensembling multiple PEFT approaches and VFM backbones to produce more robust pseudo-labels. Empirical results validate the effectiveness of this simple yet powerful approach, providing actionable insights into SSL with VFMs and paving the way for more scalable and practical semi-supervised learning in the era of foundation models.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2503.09707 [cs.LG]
	(or arXiv:2503.09707v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.09707

Submission history

From: Ping Zhang [view email]
[v1] Wed, 12 Mar 2025 18:01:10 UTC (162 KB)

Computer Science > Machine Learning

Title:Revisiting semi-supervised learning in the era of foundation models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Revisiting semi-supervised learning in the era of foundation models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators