SplitEE: Early Exit in Deep Neural Networks with Split Computing

Bajpai, Divya J.; Trivedi, Vivek K.; Yadav, Sohan L.; Hanawal, Manjesh K.

Computer Science > Machine Learning

arXiv:2309.09195 (cs)

[Submitted on 17 Sep 2023]

Title:SplitEE: Early Exit in Deep Neural Networks with Split Computing

Authors:Divya J. Bajpai, Vivek K. Trivedi, Sohan L. Yadav, Manjesh K. Hanawal

View PDF

Abstract:Deep Neural Networks (DNNs) have drawn attention because of their outstanding performance on various tasks. However, deploying full-fledged DNNs in resource-constrained devices (edge, mobile, IoT) is difficult due to their large size. To overcome the issue, various approaches are considered, like offloading part of the computation to the cloud for final inference (split computing) or performing the inference at an intermediary layer without passing through all layers (early exits). In this work, we propose combining both approaches by using early exits in split computing. In our approach, we decide up to what depth of DNNs computation to perform on the device (splitting layer) and whether a sample can exit from this layer or need to be offloaded. The decisions are based on a weighted combination of accuracy, computational, and communication costs. We develop an algorithm named SplitEE to learn an optimal policy. Since pre-trained DNNs are often deployed in new domains where the ground truths may be unavailable and samples arrive in a streaming fashion, SplitEE works in an online and unsupervised setup. We extensively perform experiments on five different datasets. SplitEE achieves a significant cost reduction ($>50\%$) with a slight drop in accuracy ($<2\%$) as compared to the case when all samples are inferred at the final layer. The anonymized source code is available at \url{this https URL}.

Comments:	10 pages, to appear in the proceeding AIMLSystems 2023
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2309.09195 [cs.LG]
	(or arXiv:2309.09195v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2309.09195

Submission history

From: Manjesh Kumar Hanawal [view email]
[v1] Sun, 17 Sep 2023 07:48:22 UTC (952 KB)

Computer Science > Machine Learning

Title:SplitEE: Early Exit in Deep Neural Networks with Split Computing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SplitEE: Early Exit in Deep Neural Networks with Split Computing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators