The Outer Product Structure of Neural Network Derivatives

Bakker, Craig; Henry, Michael J.; Hodas, Nathan O.

Computer Science > Machine Learning

arXiv:1810.03798 (cs)

[Submitted on 9 Oct 2018]

Title:The Outer Product Structure of Neural Network Derivatives

Authors:Craig Bakker, Michael J. Henry, Nathan O. Hodas

View PDF

Abstract:In this paper, we show that feedforward and recurrent neural networks exhibit an outer product derivative structure but that convolutional neural networks do not. This structure makes it possible to use higher-order information without needing approximations or infeasibly large amounts of memory, and it may also provide insights into the geometry of neural network optima. The ability to easily access these derivatives also suggests a new, geometric approach to regularization. We then discuss how this structure could be used to improve training methods, increase network robustness and generalizability, and inform network compression methods.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1810.03798 [cs.LG]
	(or arXiv:1810.03798v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.03798

Submission history

From: Craig Bakker [view email]
[v1] Tue, 9 Oct 2018 03:37:08 UTC (19 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-10

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Craig Bakker
Michael J. Henry
Nathan O. Hodas

export BibTeX citation

Computer Science > Machine Learning

Title:The Outer Product Structure of Neural Network Derivatives

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Outer Product Structure of Neural Network Derivatives

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators