Understanding How Nonlinear Layers Create Linearly Separable Features for Low-Dimensional Data

Xu, Alec S.; Yaras, Can; Wang, Peng; Qu, Qing

Computer Science > Machine Learning

arXiv:2501.02364 (cs)

[Submitted on 4 Jan 2025]

Title:Understanding How Nonlinear Layers Create Linearly Separable Features for Low-Dimensional Data

Authors:Alec S. Xu, Can Yaras, Peng Wang, Qing Qu

View PDF HTML (experimental)

Abstract:Deep neural networks have attained remarkable success across diverse classification tasks. Recent empirical studies have shown that deep networks learn features that are linearly separable across classes. However, these findings often lack rigorous justifications, even under relatively simple settings. In this work, we address this gap by examining the linear separation capabilities of shallow nonlinear networks. Specifically, inspired by the low intrinsic dimensionality of image data, we model inputs as a union of low-dimensional subspaces (UoS) and demonstrate that a single nonlinear layer can transform such data into linearly separable sets. Theoretically, we show that this transformation occurs with high probability when using random weights and quadratic activations. Notably, we prove this can be achieved when the network width scales polynomially with the intrinsic dimension of the data rather than the ambient dimension. Experimental results corroborate these theoretical findings and demonstrate that similar linear separation properties hold in practical scenarios beyond our analytical scope. This work bridges the gap between empirical observations and theoretical understanding of the separation capacity of nonlinear networks, offering deeper insights into model interpretability and generalization.

Comments:	32 pages, 9 figures
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2501.02364 [cs.LG]
	(or arXiv:2501.02364v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.02364

Submission history

From: Alec Xu [view email]
[v1] Sat, 4 Jan 2025 19:43:21 UTC (2,534 KB)

Computer Science > Machine Learning

Title:Understanding How Nonlinear Layers Create Linearly Separable Features for Low-Dimensional Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Understanding How Nonlinear Layers Create Linearly Separable Features for Low-Dimensional Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators