Aiming towards the minimizers: fast convergence of SGD for overparametrized problems

Liu, Chaoyue; Drusvyatskiy, Dmitriy; Belkin, Mikhail; Davis, Damek; Ma, Yi-An

Computer Science > Machine Learning

arXiv:2306.02601 (cs)

[Submitted on 5 Jun 2023]

Title:Aiming towards the minimizers: fast convergence of SGD for overparametrized problems

Authors:Chaoyue Liu, Dmitriy Drusvyatskiy, Mikhail Belkin, Damek Davis, Yi-An Ma

View PDF

Abstract:Modern machine learning paradigms, such as deep learning, occur in or close to the interpolation regime, wherein the number of model parameters is much larger than the number of data samples. In this work, we propose a regularity condition within the interpolation regime which endows the stochastic gradient method with the same worst-case iteration complexity as the deterministic gradient method, while using only a single sampled gradient (or a minibatch) in each iteration. In contrast, all existing guarantees require the stochastic gradient method to take small steps, thereby resulting in a much slower linear rate of convergence. Finally, we demonstrate that our condition holds when training sufficiently wide feedforward neural networks with a linear output layer.

Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2306.02601 [cs.LG]
	(or arXiv:2306.02601v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.02601

Submission history

From: Chaoyue Liu [view email]
[v1] Mon, 5 Jun 2023 05:21:01 UTC (355 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2023-06

Change to browse by:

cs.LG
math
math.OC
stat
stat.ML

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Aiming towards the minimizers: fast convergence of SGD for overparametrized problems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Aiming towards the minimizers: fast convergence of SGD for overparametrized problems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators