Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation

Surendran, Sobihan; Godichon-Baggioni, Antoine; Fermanian, Adeline; Corff, Sylvain Le

Statistics > Machine Learning

arXiv:2402.02857 (stat)

[Submitted on 5 Feb 2024 (v1), last revised 14 Mar 2025 (this version, v2)]

Title:Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation

Authors:Sobihan Surendran (LPSM (UMR\_8001)), Antoine Godichon-Baggioni (LPSM (UMR\_8001)), Adeline Fermanian, Sylvain Le Corff (LPSM (UMR\_8001))

View PDF HTML (experimental)

Abstract:Stochastic Gradient Descent (SGD) with adaptive steps is widely used to train deep neural networks and generative models. Most theoretical results assume that it is possible to obtain unbiased gradient estimators, which is not the case in several recent deep learning and reinforcement learning applications that use Monte Carlo methods. This paper provides a comprehensive non-asymptotic analysis of SGD with biased gradients and adaptive steps for non-convex smooth functions. Our study incorporates time-dependent bias and emphasizes the importance of controlling the bias of the gradient estimator. In particular, we establish that Adagrad, RMSProp, and AMSGRAD, an exponential moving average variant of Adam, with biased gradients, converge to critical points for smooth non-convex functions at a rate similar to existing results in the literature for the unbiased case. Finally, we provide experimental results using Variational Autoenconders (VAE) and applications to several learning frameworks that illustrate our convergence results and show how the effect of bias can be reduced by appropriate hyperparameter tuning.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2402.02857 [stat.ML]
	(or arXiv:2402.02857v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2402.02857

Submission history

From: Antoine Godichon-Baggioni [view email] [via CCSD proxy]
[v1] Mon, 5 Feb 2024 10:17:36 UTC (1,007 KB)
[v2] Fri, 14 Mar 2025 16:27:25 UTC (1,292 KB)

Statistics > Machine Learning

Title:Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators