Deep Neural Networks Learn Non-Smooth Functions Effectively

Imaizumi, Masaaki; Fukumizu, Kenji

Statistics > Machine Learning

arXiv:1802.04474 (stat)

[Submitted on 13 Feb 2018 (v1), last revised 7 Jul 2018 (this version, v2)]

Title:Deep Neural Networks Learn Non-Smooth Functions Effectively

Authors:Masaaki Imaizumi, Kenji Fukumizu

View PDF

Abstract:We theoretically discuss why deep neural networks (DNNs) performs better than other models in some cases by investigating statistical properties of DNNs for non-smooth functions. While DNNs have empirically shown higher performance than other standard methods, understanding its mechanism is still a challenging problem. From an aspect of the statistical theory, it is known many standard methods attain the optimal rate of generalization errors for smooth functions in large sample asymptotics, and thus it has not been straightforward to find theoretical advantages of DNNs. This paper fills this gap by considering learning of a certain class of non-smooth functions, which was not covered by the previous theory. We derive the generalization error of estimators by DNNs with a ReLU activation, and show that convergence rates of the generalization by DNNs are almost optimal to estimate the non-smooth functions, while some of the popular models do not attain the optimal rate. In addition, our theoretical result provides guidelines for selecting an appropriate number of layers and edges of DNNs. We provide numerical experiments to support the theoretical results.

Comments:	31 pages
Subjects:	Machine Learning (stat.ML)
Cite as:	arXiv:1802.04474 [stat.ML]
	(or arXiv:1802.04474v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1802.04474

Submission history

From: Masaaki Imaizumi [view email]
[v1] Tue, 13 Feb 2018 06:24:27 UTC (275 KB)
[v2] Sat, 7 Jul 2018 05:24:42 UTC (1,007 KB)

Statistics > Machine Learning

Title:Deep Neural Networks Learn Non-Smooth Functions Effectively

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Deep Neural Networks Learn Non-Smooth Functions Effectively

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators