On the Convergence of SGD Training of Neural Networks

Breuel, Thomas M.

Computer Science > Neural and Evolutionary Computing

arXiv:1508.02790 (cs)

[Submitted on 12 Aug 2015]

Title:On the Convergence of SGD Training of Neural Networks

Authors:Thomas M. Breuel

View PDF

Abstract:Neural networks are usually trained by some form of stochastic gradient descent (SGD)). A number of strategies are in common use intended to improve SGD optimization, such as learning rate schedules, momentum, and batching. These are motivated by ideas about the occurrence of local minima at different scales, valleys, and other phenomena in the objective function. Empirical results presented here suggest that these phenomena are not significant factors in SGD optimization of MLP-related objective functions, and that the behavior of stochastic gradient descent in these problems is better described as the simultaneous convergence at different rates of many, largely non-interacting subproblems

Subjects:	Neural and Evolutionary Computing (cs.NE); Machine Learning (cs.LG)
ACM classes:	K.3.2
Cite as:	arXiv:1508.02790 [cs.NE]
	(or arXiv:1508.02790v1 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1508.02790

Submission history

From: Thomas M. Breuel [view email]
[v1] Wed, 12 Aug 2015 01:11:47 UTC (2,191 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2015-08

Change to browse by:

cs
cs.NE

References & Citations

DBLP - CS Bibliography

listing | bibtex

Thomas M. Breuel

export BibTeX citation

Computer Science > Neural and Evolutionary Computing

Title:On the Convergence of SGD Training of Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:On the Convergence of SGD Training of Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators