Associative Convolutional Layers

Omidvar, Hamed; Akhlaghi, Vahideh; Franceschetti, Massimo; Gupta, Rajesh K.

Computer Science > Machine Learning

arXiv:1906.04309 (cs)

[Submitted on 10 Jun 2019 (v1), last revised 9 Aug 2019 (this version, v3)]

Title:Associative Convolutional Layers

Authors:Hamed Omidvar, Vahideh Akhlaghi, Massimo Franceschetti, Rajesh K. Gupta

View PDF

Abstract:Motivated by the necessity for parameter efficiency in distributed machine learning and AI-enabled edge devices, we provide a general and easy to implement method for significantly reducing the number of parameters of Convolutional Neural Networks (CNNs), during both the training and inference phases. We introduce a simple auxiliary neural network which can generate the convolutional filters of any CNN architecture from a low dimensional latent space. This auxiliary neural network, which we call "Convolutional Slice Generator" (CSG), is unique to the network and provides the association between its convolutional layers. During the training of the CNN, instead of training the filters of the convolutional layers, only the parameters of the CSG and their corresponding "code vectors" are trained. This results in a significant reduction of the number of parameters due to the fact that the CNN can be fully represented using only the parameters of the CSG, the code vectors, the fully connected layers, and the architecture of the CNN. We evaluate our approach by applying it to ResNet and DenseNet models when trained on CIFAR-10 and ImageNet datasets. While reducing the number of parameters by $\approx 2 \times$ on average, the accuracies of these networks remain within 1$\%$ of their original counterparts and in some cases there is an increase in the accuracy.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1906.04309 [cs.LG]
	(or arXiv:1906.04309v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1906.04309

Submission history

From: Hamed Omidvar [view email]
[v1] Mon, 10 Jun 2019 22:36:43 UTC (291 KB)
[v2] Wed, 12 Jun 2019 00:16:05 UTC (291 KB)
[v3] Fri, 9 Aug 2019 23:21:00 UTC (395 KB)

Computer Science > Machine Learning

Title:Associative Convolutional Layers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Associative Convolutional Layers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators