Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks

Zhang, Chenyang; Gao, Peifeng; Zou, Difan; Cao, Yuan

Statistics > Machine Learning

arXiv:2504.08628 (stat)

[Submitted on 11 Apr 2025]

Title:Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks

Authors:Chenyang Zhang, Peifeng Gao, Difan Zou, Yuan Cao

View PDF HTML (experimental)

Abstract:Modern neural networks are usually highly over-parameterized. Behind the wide usage of over-parameterized networks is the belief that, if the data are simple, then the trained network will be automatically equivalent to a simple predictor. Following this intuition, many existing works have studied different notions of "ranks" of neural networks and their relation to the rank of data. In this work, we study the rank of convolutional neural networks (CNNs) trained by gradient descent, with a specific focus on the robustness of the rank to image background noises. Specifically, we point out that, when adding background noises to images, the rank of the CNN trained with gradient descent is affected far less compared with the rank of the data. We support our claim with a theoretical case study, where we consider a particular data model to characterize low-rank clean images with added background noises. We prove that CNNs trained by gradient descent can learn the intrinsic dimension of clean images, despite the presence of relatively large background noises. We also conduct experiments on synthetic and real datasets to further validate our claim.

Comments:	43 pages, 4 figures
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2504.08628 [stat.ML]
	(or arXiv:2504.08628v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2504.08628

Submission history

From: Yuan Cao [view email]
[v1] Fri, 11 Apr 2025 15:29:55 UTC (1,289 KB)

Statistics > Machine Learning

Title:Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Gradient Descent Robustly Learns the Intrinsic Dimension of Data in Training Convolutional Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators