The Separation Capacity of Random Neural Networks

Dirksen, Sjoerd; Genzel, Martin; Jacques, Laurent; Stollenwerk, Alexander

Computer Science > Machine Learning

arXiv:2108.00207 (cs)

[Submitted on 31 Jul 2021 (v1), last revised 28 Nov 2022 (this version, v2)]

Title:The Separation Capacity of Random Neural Networks

Authors:Sjoerd Dirksen, Martin Genzel, Laurent Jacques, Alexander Stollenwerk

View PDF

Abstract:Neural networks with random weights appear in a variety of machine learning applications, most prominently as the initialization of many deep learning algorithms and as a computationally cheap alternative to fully learned neural networks. In the present article, we enhance the theoretical understanding of random neural networks by addressing the following data separation problem: under what conditions can a random neural network make two classes $\mathcal{X}^-, \mathcal{X}^+ \subset \mathbb{R}^d$ (with positive distance) linearly separable? We show that a sufficiently large two-layer ReLU-network with standard Gaussian weights and uniformly distributed biases can solve this problem with high probability. Crucially, the number of required neurons is explicitly linked to geometric properties of the underlying sets $\mathcal{X}^-, \mathcal{X}^+$ and their mutual arrangement. This instance-specific viewpoint allows us to overcome the usual curse of dimensionality (exponential width of the layers) in non-pathological situations where the data carries low-complexity structure. We quantify the relevant structure of the data in terms of a novel notion of mutual complexity (based on a localized version of Gaussian mean width), which leads to sound and informative separation guarantees. We connect our result with related lines of work on approximation, memorization, and generalization.

Comments:	The current version of the manuscript has been accepted to Journal of Machine Learning Research
Subjects:	Machine Learning (cs.LG); Statistics Theory (math.ST)
Cite as:	arXiv:2108.00207 [cs.LG]
	(or arXiv:2108.00207v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2108.00207
Journal reference:	J. Mach. Learn. Res. 23:309 (2022) 1-47

Submission history

From: Martin Genzel [view email]
[v1] Sat, 31 Jul 2021 10:25:26 UTC (2,299 KB)
[v2] Mon, 28 Nov 2022 08:47:17 UTC (5,090 KB)

Computer Science > Machine Learning

Title:The Separation Capacity of Random Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Separation Capacity of Random Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators