DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference

Zhang, Ziyang; Zhao, Yang; Li, Huan; Lin, Changyao; Liu, Jie

Computer Science > Machine Learning

arXiv:2306.01811 (cs)

[Submitted on 2 Jun 2023 (v1), last revised 23 Jun 2023 (this version, v3)]

Title:DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference

Authors:Ziyang Zhang, Yang Zhao, Huan Li, Changyao Lin, Jie Liu

View PDF

Abstract:Due to limited resources on edge and different characteristics of deep neural network (DNN) models, it is a big challenge to optimize DNN inference performance in terms of energy consumption and end-to-end latency on edge devices. In addition to the dynamic voltage frequency scaling (DVFS) technique, the edge-cloud architecture provides a collaborative approach for efficient DNN inference. However, current edge-cloud collaborative inference methods have not optimized various compute resources on edge devices. Thus, we propose DVFO, a novel DVFS-enabled edge-cloud collaborative inference framework, which co-optimizes DVFS and offloading parameters via deep reinforcement learning (DRL). Specifically, DVFO automatically co-optimizes 1) the CPU, GPU and memory frequencies of edge devices, and 2) the feature maps to be offloaded to cloud servers. In addition, it leverages a thinking-while-moving concurrent mechanism to accelerate the DRL learning process, and a spatial-channel attention mechanism to extract DNN feature maps of secondary importance for workload offloading. This approach improves inference performance for different DNN models under various edge-cloud network conditions. Extensive evaluations using two datasets and six widely-deployed DNN models on three heterogeneous edge devices show that DVFO significantly reduces the energy consumption by 33% on average, compared to state-of-the-art schemes. Moreover, DVFO achieves up to 28.6%-59.1% end-to-end latency reduction, while maintaining accuracy within 1% loss on average.

Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Operating Systems (cs.OS)
Cite as:	arXiv:2306.01811 [cs.LG]
	(or arXiv:2306.01811v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.01811

Submission history

From: Ziyang Zhang [view email]
[v1] Fri, 2 Jun 2023 07:00:42 UTC (1,727 KB)
[v2] Mon, 12 Jun 2023 07:11:05 UTC (3,343 KB)
[v3] Fri, 23 Jun 2023 07:34:40 UTC (4,051 KB)

Computer Science > Machine Learning

Title:DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators