On the Last-Iterate Convergence of Shuffling Gradient Methods

Liu, Zijian; Zhou, Zhengyuan

Computer Science > Machine Learning

arXiv:2403.07723v3 (cs)

[Submitted on 12 Mar 2024 (v1), last revised 6 Jun 2024 (this version, v3)]

Title:On the Last-Iterate Convergence of Shuffling Gradient Methods

Authors:Zijian Liu, Zhengyuan Zhou

View PDF HTML (experimental)

Abstract:Shuffling gradient methods are widely used in modern machine learning tasks and include three popular implementations: Random Reshuffle (RR), Shuffle Once (SO), and Incremental Gradient (IG). Compared to the empirical success, the theoretical guarantee of shuffling gradient methods was not well-understood for a long time. Until recently, the convergence rates had just been established for the average iterate for convex functions and the last iterate for strongly convex problems (using squared distance as the metric). However, when using the function value gap as the convergence criterion, existing theories cannot interpret the good performance of the last iterate in different settings (e.g., constrained optimization). To bridge this gap between practice and theory, we prove the first last-iterate convergence rates for shuffling gradient methods with respect to the objective value even without strong convexity. Our new results either (nearly) match the existing last-iterate lower bounds or are as fast as the previous best upper bounds for the average iterate.

Comments:	ICML 2024
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2403.07723 [cs.LG]
	(or arXiv:2403.07723v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.07723

Submission history

From: Zijian Liu [view email]
[v1] Tue, 12 Mar 2024 15:01:17 UTC (65 KB)
[v2] Thu, 30 May 2024 16:58:52 UTC (92 KB)
[v3] Thu, 6 Jun 2024 01:52:22 UTC (92 KB)

Computer Science > Machine Learning

Title:On the Last-Iterate Convergence of Shuffling Gradient Methods

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On the Last-Iterate Convergence of Shuffling Gradient Methods

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators