Learning towards Minimum Hyperspherical Energy

Liu, Weiyang; Lin, Rongmei; Liu, Zhen; Liu, Lixin; Yu, Zhiding; Dai, Bo; Song, Le

Computer Science > Machine Learning

arXiv:1805.09298 (cs)

[Submitted on 23 May 2018 (v1), last revised 22 Jul 2020 (this version, v9)]

Title:Learning towards Minimum Hyperspherical Energy

Authors:Weiyang Liu, Rongmei Lin, Zhen Liu, Lixin Liu, Zhiding Yu, Bo Dai, Le Song

View PDF

Abstract:Neural networks are a powerful class of nonlinear functions that can be trained end-to-end on various applications. While the over-parametrization nature in many neural networks renders the ability to fit complex functions and the strong representation power to handle challenging tasks, it also leads to highly correlated neurons that can hurt the generalization ability and incur unnecessary computation cost. As a result, how to regularize the network to avoid undesired representation redundancy becomes an important issue. To this end, we draw inspiration from a well-known problem in physics -- Thomson problem, where one seeks to find a state that distributes N electrons on a unit sphere as evenly as possible with minimum potential energy. In light of this intuition, we reduce the redundancy regularization problem to generic energy minimization, and propose a minimum hyperspherical energy (MHE) objective as generic regularization for neural networks. We also propose a few novel variants of MHE, and provide some insights from a theoretical point of view. Finally, we apply neural networks with MHE regularization to several challenging tasks. Extensive experiments demonstrate the effectiveness of our intuition, by showing the superior performance with MHE regularization.

Comments:	NeurIPS 2018
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:1805.09298 [cs.LG]
	(or arXiv:1805.09298v9 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.09298

Submission history

From: Weiyang Liu [view email]
[v1] Wed, 23 May 2018 17:34:47 UTC (8,356 KB)
[v2] Tue, 5 Jun 2018 21:50:09 UTC (8,356 KB)
[v3] Wed, 13 Jun 2018 22:44:57 UTC (8,356 KB)
[v4] Sat, 16 Jun 2018 07:47:21 UTC (8,365 KB)
[v5] Sat, 27 Oct 2018 07:17:57 UTC (9,297 KB)
[v6] Sat, 1 Dec 2018 09:28:53 UTC (9,297 KB)
[v7] Wed, 9 Jan 2019 09:16:13 UTC (9,285 KB)
[v8] Tue, 5 Mar 2019 03:07:32 UTC (9,288 KB)
[v9] Wed, 22 Jul 2020 15:23:29 UTC (9,287 KB)

Computer Science > Machine Learning

Title:Learning towards Minimum Hyperspherical Energy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning towards Minimum Hyperspherical Energy

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators