Zeroth-order Stochastic Cubic Newton Method with Low-rank Hessian Estimation

Liu, Yu; Peng, Weibin; Wang, Tianyu; Yu, Jiajia

Mathematics > Optimization and Control

arXiv:2410.22357v1 (math)

[Submitted on 16 Oct 2024 (this version), latest version 8 Apr 2025 (v2)]

Title:Zeroth-order Stochastic Cubic Newton Method with Low-rank Hessian Estimation

Authors:Yu Liu, Weibin Peng, Tianyu Wang, Jiajia Yu

View PDF HTML (experimental)

Abstract:This paper focuses on the problem of minimizing a finite-sum loss $ \frac{1}{N}$ $ \sum_{\xi=1}^N f (\mathbf{x}; \xi) $, where only function evaluations of $ f (\cdot; \xi) $ is allowed. For a fixed $ \xi $, which represents a (batch of) training data, the Hessian matrix $ \nabla^2 f (\mathbf{x}; \xi) $ is usually low-rank. We develop a stochastic zeroth-order cubic Newton method for such problems, and prove its efficiency. More specifically, we show that when $ \nabla^2 f (\mathbf{x}; \xi) \in \mathbb{R}^{n\times n } $ is of rank-$r$, $ \mathcal{O}\left(\frac{n}{\eta^{\frac{7}{2}}}\right)+\widetilde{\mathcal{O}}\left(\frac{n^2 r^2 }{\eta^{\frac{5}{2}}}\right) $ function evaluations guarantee a second order $\eta$-stationary point with high probability. This result improves the dependence on dimensionality compared to the existing state-of-the-art. This improvement is achieved via a new Hessian estimation method, which can be efficiently computed by finite-difference operations, and does not require any incoherence assumptions. Numerical experiments are provided to demonstrate the effectiveness of our algorithm.

Comments:	arXiv admin note: text overlap with arXiv:2402.05385
Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:2410.22357 [math.OC]
	(or arXiv:2410.22357v1 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2410.22357

Submission history

From: Yu Liu [view email]
[v1] Wed, 16 Oct 2024 07:58:04 UTC (1,044 KB)
[v2] Tue, 8 Apr 2025 09:29:12 UTC (1,659 KB)

Mathematics > Optimization and Control

Title:Zeroth-order Stochastic Cubic Newton Method with Low-rank Hessian Estimation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Zeroth-order Stochastic Cubic Newton Method with Low-rank Hessian Estimation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators