Coded Computing for Low-Latency Federated Learning over Wireless Edge Networks

Prakash, Saurav; Dhakal, Sagar; Akdeniz, Mustafa; Yona, Yair; Talwar, Shilpa; Avestimehr, Salman; Himayat, Nageen

doi:10.1109/JSAC.2020.3036961

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2011.06223 (cs)

[Submitted on 12 Nov 2020 (v1), last revised 9 May 2021 (this version, v2)]

Title:Coded Computing for Low-Latency Federated Learning over Wireless Edge Networks

Authors:Saurav Prakash, Sagar Dhakal, Mustafa Akdeniz, Yair Yona, Shilpa Talwar, Salman Avestimehr, Nageen Himayat

View PDF

Abstract:Federated learning enables training a global model from data located at the client nodes, without data sharing and moving client data to a centralized server. Performance of federated learning in a multi-access edge computing (MEC) network suffers from slow convergence due to heterogeneity and stochastic fluctuations in compute power and communication link qualities across clients. We propose a novel coded computing framework, CodedFedL, that injects structured coding redundancy into federated learning for mitigating stragglers and speeding up the training procedure. CodedFedL enables coded computing for non-linear federated learning by efficiently exploiting distributed kernel embedding via random Fourier features that transforms the training task into computationally favourable distributed linear regression. Furthermore, clients generate local parity datasets by coding over their local datasets, while the server combines them to obtain the global parity dataset. Gradient from the global parity dataset compensates for straggling gradients during training, and thereby speeds up convergence. For minimizing the epoch deadline time at the MEC server, we provide a tractable approach for finding the amount of coding redundancy and the number of local data points that a client processes during training, by exploiting the statistical properties of compute as well as communication delays. We also characterize the leakage in data privacy when clients share their local parity datasets with the server. We analyze the convergence rate and iteration complexity of CodedFedL under simplifying assumptions, by treating CodedFedL as a stochastic gradient descent algorithm. Furthermore, we conduct numerical experiments using practical network parameters and benchmark datasets, where CodedFedL speeds up the overall training time by up to $15\times$ in comparison to the benchmark schemes.

Comments:	Final version to appear in the first issue of the IEEE JSAC Series on Machine Learning for Communications and Networks
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2011.06223 [cs.DC]
	(or arXiv:2011.06223v2 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2011.06223
Related DOI:	https://doi.org/10.1109/JSAC.2020.3036961

Submission history

From: Saurav Prakash [view email]
[v1] Thu, 12 Nov 2020 06:21:59 UTC (2,453 KB)
[v2] Sun, 9 May 2021 19:46:31 UTC (2,453 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Coded Computing for Low-Latency Federated Learning over Wireless Edge Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:Coded Computing for Low-Latency Federated Learning over Wireless Edge Networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators