Early Directional Convergence in Deep Homogeneous Neural Networks for Small Initializations

Kumar, Akshay; Haupt, Jarvis

Computer Science > Machine Learning

arXiv:2403.08121 (cs)

[Submitted on 12 Mar 2024 (v1), last revised 14 Mar 2025 (this version, v3)]

Title:Early Directional Convergence in Deep Homogeneous Neural Networks for Small Initializations

Authors:Akshay Kumar, Jarvis Haupt

View PDF HTML (experimental)

Abstract:This paper studies the gradient flow dynamics that arise when training deep homogeneous neural networks assumed to have locally Lipschitz gradients and an order of homogeneity strictly greater than two. It is shown here that for sufficiently small initializations, during the early stages of training, the weights of the neural network remain small in (Euclidean) norm and approximately converge in direction to the Karush-Kuhn-Tucker (KKT) points of the recently introduced neural correlation function. Additionally, this paper also studies the KKT points of the neural correlation function for feed-forward networks with (Leaky) ReLU and polynomial (Leaky) ReLU activations, deriving necessary and sufficient conditions for rank-one KKT points.

Comments:	tmlr-final-version
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2403.08121 [cs.LG]
	(or arXiv:2403.08121v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.08121

Submission history

From: Akshay Kumar [view email]
[v1] Tue, 12 Mar 2024 23:17:32 UTC (605 KB)
[v2] Sat, 7 Dec 2024 17:30:26 UTC (581 KB)
[v3] Fri, 14 Mar 2025 16:46:23 UTC (5,843 KB)

Full-text links:

Access Paper:

view license

Current browse context:

math

< prev | next >

new | recent | 2024-03

Change to browse by:

cs
cs.LG
math.OC
stat
stat.ML

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Early Directional Convergence in Deep Homogeneous Neural Networks for Small Initializations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Early Directional Convergence in Deep Homogeneous Neural Networks for Small Initializations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators