AMP4EC: Adaptive Model Partitioning Framework for Efficient Deep Learning Inference in Edge Computing Environments

Zhang, Guilin; Guo, Wulan; Tan, Ziqi; Jiang, Hailong

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2504.00407 (cs)

[Submitted on 1 Apr 2025 (v1), last revised 4 Apr 2025 (this version, v2)]

Title:AMP4EC: Adaptive Model Partitioning Framework for Efficient Deep Learning Inference in Edge Computing Environments

Authors:Guilin Zhang, Wulan Guo, Ziqi Tan, Hailong Jiang

View PDF HTML (experimental)

Abstract:Edge computing facilitates deep learning in resource-constrained environments, but challenges such as resource heterogeneity and dynamic constraints persist. This paper introduces AMP4EC, an Adaptive Model Partitioning framework designed to optimize deep learning inference in edge environments through real-time resource monitoring, dynamic model partitioning, and adaptive task scheduling. AMP4EC features a resource-aware model partitioner that splits deep learning models based on device capabilities, a task scheduler that ensures efficient load balancing using a weighted scoring mechanism, and a Docker-based deployment environment for validation. Experimental results show up to a 78% reduction in latency and a 414% improvement in throughput compared to baseline methods. The framework achieves consistent performance with low scheduling overhead across varying resource profiles, demonstrating adaptability in high-resource (1 CPU, 1GB RAM) and low-resource (0.4 CPU, 512MB RAM) scenarios. These results highlight AMP4EC's scalability, efficiency, and robustness for real-world edge deployments, addressing the critical need for efficient distributed inference in dynamic, resource-constrained environments.

Comments:	8 pages, accepted for oral presentation at FMEC 2025
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC)
ACM classes:	I.2.11; I.2.6; C.2.4
Cite as:	arXiv:2504.00407 [cs.DC]
	(or arXiv:2504.00407v2 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2504.00407

Submission history

From: Guilin Zhang [view email]
[v1] Tue, 1 Apr 2025 04:08:37 UTC (314 KB)
[v2] Fri, 4 Apr 2025 04:28:20 UTC (146 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:AMP4EC: Adaptive Model Partitioning Framework for Efficient Deep Learning Inference in Edge Computing Environments

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:AMP4EC: Adaptive Model Partitioning Framework for Efficient Deep Learning Inference in Edge Computing Environments

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators