Ramanujan Bipartite Graph Products for Efficient Block Sparse Neural Networks

Vooturi, Dharma Teja; Varma, Girish; Kothapalli, Kishore

Computer Science > Machine Learning

arXiv:2006.13486v1 (cs)

[Submitted on 24 Jun 2020 (this version), latest version 2 Jul 2020 (v2)]

Title:Ramanujan Bipartite Graph Products for Efficient Block Sparse Neural Networks

Authors:Dharma Teja Vooturi, Girish Varma, Kishore Kothapalli

View PDF

Abstract:Sparse neural networks are shown to give accurate predictions competitive to denser versions, while also minimizing the number of arithmetic operations performed. However current hardware like GPU's can only exploit structured sparsity patterns for better efficiency. Hence the run time of a sparse neural network may not correspond to the arithmetic operations required.
In this work, we propose RBGP( Ramanujan Bipartite Graph Product) framework for generating structured multi level block sparse neural networks by using the theory of Graph products. We also propose to use products of Ramanujan graphs which gives the best connectivity for a given level of sparsity. This essentially ensures that the i.) the networks has the structured block sparsity for which runtime efficient algorithms exists ii.) the model gives high prediction accuracy, due to the better expressive power derived from the connectivity of the graph iii.) the graph data structure has a succinct representation that can be stored efficiently in memory. We use our framework to design a specific connectivity pattern called RBGP4 which makes efficient use of the memory hierarchy available on GPU. We benchmark our approach by experimenting on image classification task over CIFAR dataset using VGG19 and WideResnet-40-4 networks and achieve 5-9x and 2-5x runtime gains over unstructured and block sparsity patterns respectively, while achieving the same level of accuracy.

Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (stat.ML)
Cite as:	arXiv:2006.13486 [cs.LG]
	(or arXiv:2006.13486v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2006.13486

Submission history

From: Dharma Teja Vooturi [view email]
[v1] Wed, 24 Jun 2020 05:08:17 UTC (573 KB)
[v2] Thu, 2 Jul 2020 12:22:52 UTC (574 KB)

Computer Science > Machine Learning

Title:Ramanujan Bipartite Graph Products for Efficient Block Sparse Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Ramanujan Bipartite Graph Products for Efficient Block Sparse Neural Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators