Generalization Error Bounds with Probabilistic Guarantee for SGD in Nonconvex Optimization

Zhou, Yi; Liang, Yingbin; Zhang, Huishuai

Statistics > Machine Learning

arXiv:1802.06903 (stat)

[Submitted on 19 Feb 2018 (v1), last revised 7 Mar 2019 (this version, v3)]

Title:Generalization Error Bounds with Probabilistic Guarantee for SGD in Nonconvex Optimization

Authors:Yi Zhou, Yingbin Liang, Huishuai Zhang

View PDF

Abstract:The success of deep learning has led to a rising interest in the generalization property of the stochastic gradient descent (SGD) method, and stability is one popular approach to study it. Existing works based on stability have studied nonconvex loss functions, but only considered the generalization error of the SGD in expectation. In this paper, we establish various generalization error bounds with probabilistic guarantee for the SGD. Specifically, for both general nonconvex loss functions and gradient dominant loss functions, we characterize the on-average stability of the iterates generated by SGD in terms of the on-average variance of the stochastic gradients. Such characterization leads to improved bounds for the generalization error for SGD. We then study the regularized risk minimization problem with strongly convex regularizers, and obtain improved generalization error bounds for proximal SGD. With strongly convex regularizers, we further establish the generalization error bounds for nonconvex loss functions under proximal SGD with high-probability guarantee, i.e., exponential concentration in probability.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Optimization and Control (math.OC)
Cite as:	arXiv:1802.06903 [stat.ML]
	(or arXiv:1802.06903v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1802.06903

Submission history

From: Yi Zhou [view email]
[v1] Mon, 19 Feb 2018 23:04:20 UTC (64 KB)
[v2] Tue, 11 Dec 2018 19:13:12 UTC (64 KB)
[v3] Thu, 7 Mar 2019 03:30:18 UTC (108 KB)

Statistics > Machine Learning

Title:Generalization Error Bounds with Probabilistic Guarantee for SGD in Nonconvex Optimization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Generalization Error Bounds with Probabilistic Guarantee for SGD in Nonconvex Optimization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators