Random matrix analysis of deep neural network weight matrices

Thamm, Matthias; Staats, Max; Rosenow, Bernd

doi:10.1103/PhysRevE.106.054124

Condensed Matter > Disordered Systems and Neural Networks

arXiv:2203.14661 (cond-mat)

[Submitted on 28 Mar 2022 (v1), last revised 15 Nov 2022 (this version, v2)]

Title:Random matrix analysis of deep neural network weight matrices

Authors:Matthias Thamm, Max Staats, Bernd Rosenow

View PDF

Abstract:Neural networks have been used successfully in a variety of fields, which has led to a great deal of interest in developing a theoretical understanding of how they store the information needed to perform a particular task. We study the weight matrices of trained deep neural networks using methods from random matrix theory (RMT) and show that the statistics of most of the singular values follow universal RMT predictions. This suggests that they are random and do not contain system specific information, which we investigate further by comparing the statistics of eigenvector entries to the universal Porter-Thomas distribution. We find that for most eigenvectors the hypothesis of randomness cannot be rejected, and that only eigenvectors belonging to the largest singular values deviate from the RMT prediction, indicating that they may encode learned information. In addition, a comparison with RMT predictions also allows to distinguish networks trained in different learning regimes - from lazy to rich learning. We analyze the spectral distribution of the large singular values using the Hill estimator and find that the distribution cannot in general be characterized by a tail index, i.e. is not of power law type.

Comments:	16 pages, 14 figures, updated version
Subjects:	Disordered Systems and Neural Networks (cond-mat.dis-nn); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2203.14661 [cond-mat.dis-nn]
	(or arXiv:2203.14661v2 [cond-mat.dis-nn] for this version)
	https://doi.org/10.48550/arXiv.2203.14661
Journal reference:	Physical Review E 106, 054124 (2022)
Related DOI:	https://doi.org/10.1103/PhysRevE.106.054124

Submission history

From: Matthias Thamm [view email]
[v1] Mon, 28 Mar 2022 11:22:12 UTC (1,235 KB)
[v2] Tue, 15 Nov 2022 11:16:15 UTC (766 KB)

Condensed Matter > Disordered Systems and Neural Networks

Title:Random matrix analysis of deep neural network weight matrices

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Condensed Matter > Disordered Systems and Neural Networks

Title:Random matrix analysis of deep neural network weight matrices

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators