Benign Overfitting in Classification: Provably Counter Label Noise with Larger Models

Wen, Kaiyue; Teng, Jiaye; Zhang, Jingzhao

Computer Science > Machine Learning

arXiv:2206.00501 (cs)

[Submitted on 1 Jun 2022 (v1), last revised 3 Apr 2023 (this version, v2)]

Title:Benign Overfitting in Classification: Provably Counter Label Noise with Larger Models

Authors:Kaiyue Wen, Jiaye Teng, Jingzhao Zhang

View PDF

Abstract:Studies on benign overfitting provide insights for the success of overparameterized deep learning models. In this work, we examine whether overfitting is truly benign in real-world classification tasks. We start with the observation that a ResNet model overfits benignly on Cifar10 but not benignly on ImageNet. To understand why benign overfitting fails in the ImageNet experiment, we theoretically analyze benign overfitting under a more restrictive setup where the number of parameters is not significantly larger than the number of data points. Under this mild overparameterization setup, our analysis identifies a phase change: unlike in the previous heavy overparameterization settings, benign overfitting can now fail in the presence of label noise. Our analysis explains our empirical observations, and is validated by a set of control experiments with ResNets. Our work highlights the importance of understanding implicit bias in underfitting regimes as a future direction.

Comments:	Published as a conference paper at ICLR 2023
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2206.00501 [cs.LG]
	(or arXiv:2206.00501v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.00501

Submission history

From: Jiaye Teng [view email]
[v1] Wed, 1 Jun 2022 14:00:37 UTC (374 KB)
[v2] Mon, 3 Apr 2023 13:32:08 UTC (236 KB)

Computer Science > Machine Learning

Title:Benign Overfitting in Classification: Provably Counter Label Noise with Larger Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Benign Overfitting in Classification: Provably Counter Label Noise with Larger Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators