Mad Max: Affine Spline Insights into Deep Learning

Balestriero, Randall; Baraniuk, Richard

Statistics > Machine Learning

arXiv:1805.06576 (stat)

[Submitted on 17 May 2018 (v1), last revised 11 Nov 2018 (this version, v5)]

Title:Mad Max: Affine Spline Insights into Deep Learning

Authors:Randall Balestriero, Richard Baraniuk

View PDF

Abstract:We build a rigorous bridge between deep networks (DNs) and approximation theory via spline functions and operators. Our key result is that a large class of DNs can be written as a composition of max-affine spline operators (MASOs), which provide a powerful portal through which to view and analyze their inner workings. For instance, conditioned on the input signal, the output of a MASO DN can be written as a simple affine transformation of the input. This implies that a DN constructs a set of signal-dependent, class-specific templates against which the signal is compared via a simple inner product; we explore the links to the classical theory of optimal classification via matched filters and the effects of data memorization. Going further, we propose a simple penalty term that can be added to the cost function of any DN learning algorithm to force the templates to be orthogonal with each other; this leads to significantly improved classification performance and reduced overfitting with no change to the DN architecture. The spline partition of the input signal space that is implicitly induced by a MASO directly links DNs to the theory of vector quantization (VQ) and $K$-means clustering, which opens up new geometric avenue to study how DNs organize signals in a hierarchical fashion. To validate the utility of the VQ interpretation, we develop and validate a new distance metric for signals and images that quantifies the difference between their VQ encodings. (This paper is a significantly expanded version of A Spline Theory of Deep Learning from ICML 2018.)

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1805.06576 [stat.ML]
	(or arXiv:1805.06576v5 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1805.06576

Submission history

From: Randall Balestriero [view email]
[v1] Thu, 17 May 2018 02:04:54 UTC (7,458 KB)
[v2] Sat, 14 Jul 2018 09:45:33 UTC (8,114 KB)
[v3] Sat, 21 Jul 2018 11:32:05 UTC (8,217 KB)
[v4] Wed, 25 Jul 2018 19:34:14 UTC (8,273 KB)
[v5] Sun, 11 Nov 2018 23:01:58 UTC (7,030 KB)

Statistics > Machine Learning

Title:Mad Max: Affine Spline Insights into Deep Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Mad Max: Affine Spline Insights into Deep Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators