Convergence Rates of Stochastic Zeroth-order Gradient Descent for \L ojasiewicz Functions

Wang, Tianyu; Feng, Yasong

Mathematics > Optimization and Control

arXiv:2210.16997v3 (math)

[Submitted on 31 Oct 2022 (v1), revised 27 Feb 2023 (this version, v3), latest version 19 Apr 2023 (v6)]

Title:Convergence Rates of Stochastic Zeroth-order Gradient Descent for Łojasiewicz Functions

Authors:Tianyu Wang, Yasong Feng

View PDF

Abstract:We prove convergence rates of Stochastic Zeroth-order Gradient Descent (SZGD) algorithms for Lojasiewicz functions. The SZGD algorithm iterates as \begin{align*}
\mathbf{x}_{t+1} = \mathbf{x}_t - \eta_t \widehat{\nabla} f (\mathbf{x}_t), \qquad t = 0,1,2,3,\cdots , \end{align*} where $f$ is the objective function that satisfies the Łojasiewicz inequality with Łojasiewicz exponent $\theta$, $\eta_t$ is the step size (learning rate), and $ \widehat{\nabla} f (\mathbf{x}_t) $ is the approximate gradient estimated using zeroth-order information only.
Our results show that $ \{ f (\mathbf{x}_t) - f (\mathbf{x}_\infty) \}_{t \in \mathbb{N} } $ can converge faster than $ \{ \| \mathbf{x}_t - \mathbf{x}_\infty \| \}_{t \in \mathbb{N} }$, regardless of whether the objective $f$ is smooth or nonsmooth.

Comments:	more than major revision. Y. Feng is added to the author list
Subjects:	Optimization and Control (math.OC); Machine Learning (cs.LG)
Cite as:	arXiv:2210.16997 [math.OC]
	(or arXiv:2210.16997v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2210.16997

Submission history

From: Tianyu Wang [view email]
[v1] Mon, 31 Oct 2022 00:53:17 UTC (668 KB)
[v2] Mon, 14 Nov 2022 05:52:39 UTC (669 KB)
[v3] Mon, 27 Feb 2023 23:57:26 UTC (1,405 KB)
[v4] Thu, 9 Mar 2023 00:30:56 UTC (1,404 KB)
[v5] Mon, 20 Mar 2023 14:18:13 UTC (1,406 KB)
[v6] Wed, 19 Apr 2023 12:20:47 UTC (1,406 KB)

Mathematics > Optimization and Control

Title:Convergence Rates of Stochastic Zeroth-order Gradient Descent for Łojasiewicz Functions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Convergence Rates of Stochastic Zeroth-order Gradient Descent for Łojasiewicz Functions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators