Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization

Liu, Sijia; Kailkhura, Bhavya; Chen, Pin-Yu; Ting, Paishun; Chang, Shiyu; Amini, Lisa

Computer Science > Machine Learning

arXiv:1805.10367 (cs)

[Submitted on 25 May 2018 (v1), last revised 7 Jun 2018 (this version, v2)]

Title:Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization

Authors:Sijia Liu, Bhavya Kailkhura, Pin-Yu Chen, Paishun Ting, Shiyu Chang, Lisa Amini

View PDF

Abstract:As application demands for zeroth-order (gradient-free) optimization accelerate, the need for variance reduced and faster converging approaches is also intensifying. This paper addresses these challenges by presenting: a) a comprehensive theoretical analysis of variance reduced zeroth-order (ZO) optimization, b) a novel variance reduced ZO algorithm, called ZO-SVRG, and c) an experimental evaluation of our approach in the context of two compelling applications, black-box chemical material classification and generation of adversarial examples from black-box deep neural network models. Our theoretical analysis uncovers an essential difficulty in the analysis of ZO-SVRG: the unbiased assumption on gradient estimates no longer holds. We prove that compared to its first-order counterpart, ZO-SVRG with a two-point random gradient estimator could suffer an additional error of order $O(1/b)$, where $b$ is the mini-batch size. To mitigate this error, we propose two accelerated versions of ZO-SVRG utilizing variance reduced gradient estimators, which achieve the best rate known for ZO stochastic optimization (in terms of iterations). Our extensive experimental results show that our approaches outperform other state-of-the-art ZO algorithms, and strike a balance between the convergence rate and the function query complexity.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1805.10367 [cs.LG]
	(or arXiv:1805.10367v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.10367

Submission history

From: Sijia Liu [view email]
[v1] Fri, 25 May 2018 21:18:19 UTC (2,796 KB)
[v2] Thu, 7 Jun 2018 17:02:19 UTC (2,491 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Sijia Liu
Bhavya Kailkhura
Pin-Yu Chen
Pai-Shun Ting
Shiyu Chang

…

export BibTeX citation

Computer Science > Machine Learning

Title:Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators