Joint Optimization of Offloading, Batching and DVFS for Multiuser Co-Inference

Xu, Yaodan; Zhou, Sheng; Niu, Zhisheng

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2504.14611 (cs)

[Submitted on 20 Apr 2025]

Title:Joint Optimization of Offloading, Batching and DVFS for Multiuser Co-Inference

Authors:Yaodan Xu, Sheng Zhou, Zhisheng Niu

View PDF HTML (experimental)

Abstract:With the growing integration of artificial intelligence in mobile applications, a substantial number of deep neural network (DNN) inference requests are generated daily by mobile devices. Serving these requests presents significant challenges due to limited device resources and strict latency requirements. Therefore, edge-device co-inference has emerged as an effective paradigm to address these issues. In this study, we focus on a scenario where multiple mobile devices offload inference tasks to an edge server equipped with a graphics processing unit (GPU). For finer control over offloading and scheduling, inference tasks are partitioned into smaller sub-tasks. Additionally, GPU batch processing is employed to boost throughput and improve energy efficiency. This work investigates the problem of minimizing total energy consumption while meeting hard latency constraints. We propose a low-complexity Joint DVFS, Offloading, and Batching strategy (J-DOB) to solve this problem. The effectiveness of the proposed algorithm is validated through extensive experiments across varying user numbers and deadline constraints. Results show that J-DOB can reduce energy consumption by up to 51.30% and 45.27% under identical and different deadlines, respectively, compared to local computing.

Comments:	Accepted by 2025 IEEE International Conference on Communications (ICC)
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2504.14611 [cs.DC]
	(or arXiv:2504.14611v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2504.14611

Submission history

From: Yaodan Xu [view email]
[v1] Sun, 20 Apr 2025 13:39:03 UTC (441 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Joint Optimization of Offloading, Batching and DVFS for Multiuser Co-Inference

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Joint Optimization of Offloading, Batching and DVFS for Multiuser Co-Inference

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators