Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression

Su, Jiahao; Li, Jingling; Bhattacharjee, Bobby; Huang, Furong

Statistics > Machine Learning

arXiv:1805.10352 (stat)

[Submitted on 25 May 2018 (v1), last revised 8 Dec 2018 (this version, v3)]

Title:Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression

Authors:Jiahao Su, Jingling Li, Bobby Bhattacharjee, Furong Huang

View PDF

Abstract:We propose tensorial neural networks (TNNs), a generalization of existing neural networks by extending tensor operations on low order operands to those on high order ones. The problem of parameter learning is challenging, as it corresponds to hierarchical nonlinear tensor decomposition. We propose to solve the learning problem using stochastic gradient descent by deriving nontrivial backpropagation rules in generalized tensor algebra we introduce. Our proposed TNNs has three advantages over existing neural networks: (1) TNNs naturally apply to high order input object and thus preserve the multi-dimensional structure in the input, as there is no need to flatten the data. (2) TNNs interpret designs of existing neural network architectures. (3) Mapping a neural network to TNNs with the same expressive power results in a TNN of fewer parameters. TNN based compression of neural network improves existing low-rank approximation based compression methods as TNNs exploit two other types of invariant structures, periodicity and modulation, in addition to the low rankness. Experiments on LeNet-5 (MNIST), ResNet-32 (CIFAR10) and ResNet-50 (ImageNet) demonstrate that our TNN based compression outperforms (5% test accuracy improvement universally on CIFAR10) the state-of-the-art low-rank approximation based compression methods under the same compression rate, besides achieving orders of magnitude faster convergence rates due to the efficiency of TNNs.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1805.10352 [stat.ML]
	(or arXiv:1805.10352v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1805.10352

Submission history

From: Jiahao Su [view email]
[v1] Fri, 25 May 2018 20:21:50 UTC (109 KB)
[v2] Tue, 10 Jul 2018 19:03:18 UTC (126 KB)
[v3] Sat, 8 Dec 2018 23:16:03 UTC (270 KB)

Statistics > Machine Learning

Title:Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators