Noether's Learning Dynamics: Role of Symmetry Breaking in Neural Networks

Tanaka, Hidenori; Kunin, Daniel

Computer Science > Machine Learning

arXiv:2105.02716 (cs)

[Submitted on 6 May 2021 (v1), last revised 2 Nov 2021 (this version, v2)]

Title:Noether's Learning Dynamics: Role of Symmetry Breaking in Neural Networks

Authors:Hidenori Tanaka, Daniel Kunin

View PDF

Abstract:In nature, symmetry governs regularities, while symmetry breaking brings texture. In artificial neural networks, symmetry has been a central design principle to efficiently capture regularities in the world, but the role of symmetry breaking is not well understood. Here, we develop a theoretical framework to study the "geometry of learning dynamics" in neural networks, and reveal a key mechanism of explicit symmetry breaking behind the efficiency and stability of modern neural networks. To build this understanding, we model the discrete learning dynamics of gradient descent using a continuous-time Lagrangian formulation, in which the learning rule corresponds to the kinetic energy and the loss function corresponds to the potential energy. Then, we identify "kinetic symmetry breaking" (KSB), the condition when the kinetic energy explicitly breaks the symmetry of the potential function. We generalize Noether's theorem known in physics to take into account KSB and derive the resulting motion of the Noether charge: "Noether's Learning Dynamics" (NLD). Finally, we apply NLD to neural networks with normalization layers and reveal how KSB introduces a mechanism of "implicit adaptive optimization", establishing an analogy between learning dynamics induced by normalization layers and RMSProp. Overall, through the lens of Lagrangian mechanics, we have established a theoretical foundation to discover geometric design principles for the learning dynamics of neural networks.

Subjects:	Machine Learning (cs.LG); Disordered Systems and Neural Networks (cond-mat.dis-nn); Statistical Mechanics (cond-mat.stat-mech); Neurons and Cognition (q-bio.NC); Machine Learning (stat.ML)
Cite as:	arXiv:2105.02716 [cs.LG]
	(or arXiv:2105.02716v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2105.02716
Journal reference:	NeurIPS (Advances in Neural Information Processing Systems), 2021

Submission history

From: Hidenori Tanaka [view email]
[v1] Thu, 6 May 2021 14:36:10 UTC (20 KB)
[v2] Tue, 2 Nov 2021 11:38:59 UTC (321 KB)

Computer Science > Machine Learning

Title:Noether's Learning Dynamics: Role of Symmetry Breaking in Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Noether's Learning Dynamics: Role of Symmetry Breaking in Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators