Stability and optimality in stochastic gradient descent

Toulis, Panos; Tran, Dustin; Airoldi, Edoardo M.

Statistics > Methodology

arXiv:1505.02417v1 (stat)

[Submitted on 10 May 2015 (this version), latest version 7 Jun 2016 (v4)]

Title:Stability and optimality in stochastic gradient descent

Authors:Panos Toulis, Dustin Tran, Edoardo M. Airoldi

View PDF

Abstract:Stochastic gradient methods have increasingly become popular for large-scale optimization. However, they are often numerically unstable because of their sensitivity to hyperparameters in the learning rate; furthermore they are statistically inefficient because of their suboptimal usage of the data's information. We propose a new learning procedure, termed averaged implicit stochastic gradient descent (ai-SGD), which combines stability through proximal (implicit) updates and statistical efficiency through averaging of the iterates.
In an asymptotic analysis we prove convergence of the procedure and show that it is statistically optimal, i.e., it achieves the Cramer-Rao lower variance bound. In a non-asymptotic analysis, we show that the stability of ai-SGD is due to its robustness to misspecifications of the learning rate with respect to the convexity of the loss function. Our experiments demonstrate that ai-SGD performs on par with state-of-the-art learning methods. Moreover, ai-SGD is more stable than averaging methods that do not utilize proximal updates, and it is simpler and computationally more efficient than methods that do employ proximal updates in an incremental fashion.

Subjects:	Methodology (stat.ME); Machine Learning (cs.LG); Computation (stat.CO); Machine Learning (stat.ML)
Cite as:	arXiv:1505.02417 [stat.ME]
	(or arXiv:1505.02417v1 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.1505.02417

Submission history

From: Dustin Tran [view email]
[v1] Sun, 10 May 2015 18:10:07 UTC (54 KB)
[v2] Tue, 20 Oct 2015 03:01:53 UTC (102 KB)
[v3] Fri, 3 Jun 2016 23:11:21 UTC (96 KB)
[v4] Tue, 7 Jun 2016 04:02:43 UTC (96 KB)

Statistics > Methodology

Title:Stability and optimality in stochastic gradient descent

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:Stability and optimality in stochastic gradient descent

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators