The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization

Li, Mufan Bill; Nica, Mihai; Roy, Daniel M.

Statistics > Machine Learning

arXiv:2206.02768 (stat)

[Submitted on 6 Jun 2022 (v1), last revised 14 Jun 2023 (this version, v3)]

Title:The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization

Authors:Mufan Bill Li, Mihai Nica, Daniel M. Roy

View PDF

Abstract:The logit outputs of a feedforward neural network at initialization are conditionally Gaussian, given a random covariance matrix defined by the penultimate layer. In this work, we study the distribution of this random matrix. Recent work has shown that shaping the activation function as network depth grows large is necessary for this covariance matrix to be non-degenerate. However, the current infinite-width-style understanding of this shaping method is unsatisfactory for large depth: infinite-width analyses ignore the microscopic fluctuations from layer to layer, but these fluctuations accumulate over many layers.
To overcome this shortcoming, we study the random covariance matrix in the shaped infinite-depth-and-width limit. We identify the precise scaling of the activation function necessary to arrive at a non-trivial limit, and show that the random covariance matrix is governed by a stochastic differential equation (SDE) that we call the Neural Covariance SDE. Using simulations, we show that the SDE closely matches the distribution of the random covariance matrix of finite networks. Additionally, we recover an if-and-only-if condition for exploding and vanishing norms of large shaped networks based on the activation function.

Comments:	48 pages, 10 figures. Advances in Neural Information Processing Systems (2022)
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2206.02768 [stat.ML]
	(or arXiv:2206.02768v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2206.02768

Submission history

From: Mufan (Bill) Li [view email]
[v1] Mon, 6 Jun 2022 17:45:07 UTC (222 KB)
[v2] Mon, 7 Nov 2022 19:05:09 UTC (218 KB)
[v3] Wed, 14 Jun 2023 19:07:07 UTC (218 KB)

Statistics > Machine Learning

Title:The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:The Neural Covariance SDE: Shaped Infinite Depth-and-Width Networks at Initialization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators