Zeroth-order Stochastic Cubic Newton Method Revisited

Liu, Yu; Peng, Weibin; Wang, Tianyu; Yu, Jiajia

Mathematics > Optimization and Control

arXiv:2410.22357 (math)

[Submitted on 16 Oct 2024 (v1), last revised 8 Apr 2025 (this version, v2)]

Title:Zeroth-order Stochastic Cubic Newton Method Revisited

Authors:Yu Liu, Weibin Peng, Tianyu Wang, Jiajia Yu

View PDF HTML (experimental)

Abstract:This paper studies stochastic minimization of a finite-sum loss $ F (\mathbf{x}) = \frac{1}{N} \sum_{\xi=1}^N f(\mathbf{x};\xi) $. In many real-world scenarios, the Hessian matrix of such objectives exhibits a low-rank structure on a batch of data. At the same time, zeroth-order optimization has gained prominence in important applications such as fine-tuning large language models. Drawing on these observations, we propose a novel stochastic zeroth-order cubic Newton method that leverages the low-rank Hessian structure via a matrix recovery-based estimation technique. Our method circumvents restrictive incoherence assumptions, enabling accurate Hessian approximation through finite-difference queries. Theoretically, we establish that for most real-world problems in $\mathbb{R}^n$, $\mathcal{O}\left(\frac{n}{\eta^{\frac{7}{2}}}\right)+\widetilde{\mathcal{O}}\left(\frac{n^2 }{\eta^{\frac{5}{2}}}\right)$ function evaluations suffice to attain a second-order $\eta$-stationary point with high probability. This represents a significant improvement in dimensional dependence over existing methods. This improvement is mostly due to a new Hessian estimator that achieves superior sample complexity; This new Hessian estimation method might be of separate interest. Numerical experiments on matrix recovery and machine learning tasks validate the efficacy and scalability of our approach.

Comments:	arXiv admin note: text overlap with arXiv:2402.05385
Subjects:	Optimization and Control (math.OC)
Cite as:	arXiv:2410.22357 [math.OC]
	(or arXiv:2410.22357v2 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2410.22357

Submission history

From: Yu Liu [view email]
[v1] Wed, 16 Oct 2024 07:58:04 UTC (1,044 KB)
[v2] Tue, 8 Apr 2025 09:29:12 UTC (1,659 KB)

Mathematics > Optimization and Control

Title:Zeroth-order Stochastic Cubic Newton Method Revisited

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Zeroth-order Stochastic Cubic Newton Method Revisited

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators