On Separation Between Best-Iterate, Random-Iterate, and Last-Iterate Convergence of Learning in Games

Cai, Yang; Farina, Gabriele; Grand-Clément, Julien; Kroer, Christian; Lee, Chung-Wei; Luo, Haipeng; Zheng, Weiqiang

Computer Science > Machine Learning

arXiv:2503.02825 (cs)

[Submitted on 4 Mar 2025]

Title:On Separation Between Best-Iterate, Random-Iterate, and Last-Iterate Convergence of Learning in Games

Authors:Yang Cai, Gabriele Farina, Julien Grand-Clément, Christian Kroer, Chung-Wei Lee, Haipeng Luo, Weiqiang Zheng

View PDF HTML (experimental)

Abstract:Non-ergodic convergence of learning dynamics in games is widely studied recently because of its importance in both theory and practice. Recent work (Cai et al., 2024) showed that a broad class of learning dynamics, including Optimistic Multiplicative Weights Update (OMWU), can exhibit arbitrarily slow last-iterate convergence even in simple $2 \times 2$ matrix games, despite many of these dynamics being known to converge asymptotically in the last iterate. It remains unclear, however, whether these algorithms achieve fast non-ergodic convergence under weaker criteria, such as best-iterate convergence. We show that for $2\times 2$ matrix games, OMWU achieves an $O(T^{-1/6})$ best-iterate convergence rate, in stark contrast to its slow last-iterate convergence in the same class of games. Furthermore, we establish a lower bound showing that OMWU does not achieve any polynomial random-iterate convergence rate, measured by the expected duality gaps across all iterates. This result challenges the conventional wisdom that random-iterate convergence is essentially equivalent to best-iterate convergence, with the former often used as a proxy for establishing the latter. Our analysis uncovers a new connection to dynamic regret and presents a novel two-phase approach to best-iterate convergence, which could be of independent interest.

Comments:	33 pages
Subjects:	Machine Learning (cs.LG); Computer Science and Game Theory (cs.GT); Optimization and Control (math.OC)
Cite as:	arXiv:2503.02825 [cs.LG]
	(or arXiv:2503.02825v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.02825

Submission history

From: Weiqiang Zheng [view email]
[v1] Tue, 4 Mar 2025 17:49:24 UTC (477 KB)

Computer Science > Machine Learning

Title:On Separation Between Best-Iterate, Random-Iterate, and Last-Iterate Convergence of Learning in Games

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:On Separation Between Best-Iterate, Random-Iterate, and Last-Iterate Convergence of Learning in Games

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators