OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning

Wang, Shuo; Nepal, Surya; Moore, Kristen; Grobler, Marthie; Rudolph, Carsten; Abuadbba, Alsharif

doi:10.1109/TPDS.2022.3157258

Computer Science > Machine Learning

arXiv:2105.00602 (cs)

[Submitted on 3 May 2021 (v1), last revised 3 Mar 2022 (this version, v2)]

Title:OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning

Authors:Shuo Wang, Surya Nepal, Kristen Moore, Marthie Grobler, Carsten Rudolph, Alsharif Abuadbba

View PDF

Abstract:The diversity and quantity of data warehouses, gathering data from distributed devices such as mobile devices, can enhance the success and robustness of machine learning algorithms. Federated learning enables distributed participants to collaboratively learn a commonly-shared model while holding data locally. However, it is also faced with expensive communication and limitations due to the heterogeneity of distributed data sources and lack of access to global data. In this paper, we investigate a practical distributed learning scenario where multiple downstream tasks (e.g., classifiers) could be efficiently learned from dynamically-updated and non-iid distributed data sources while providing local data privatization. We introduce a new distributed/collaborative learning scheme to address communication overhead via latent compression, leveraging global data while providing privatization of local data without additional cost due to encryption or perturbation. This scheme divides learning into (1) informative feature encoding, and transmitting the latent representation of local data to address communication overhead; (2) downstream tasks centralized at the server using the encoded codes gathered from each node to address computing overhead. Besides, a disentanglement strategy is applied to address the privatization of sensitive components of local data. Extensive experiments are conducted on image and speech datasets. The results demonstrate that downstream tasks on the compact latent representations with the privatization of local data can achieve comparable accuracy to centralized learning.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2105.00602 [cs.LG]
	(or arXiv:2105.00602v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2105.00602
Related DOI:	https://doi.org/10.1109/TPDS.2022.3157258

Submission history

From: Shuo Wang [view email]
[v1] Mon, 3 May 2021 02:24:53 UTC (12,245 KB)
[v2] Thu, 3 Mar 2022 06:10:32 UTC (2,907 KB)

Computer Science > Machine Learning

Title:OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:OCTOPUS: Overcoming Performance andPrivatization Bottlenecks in Distributed Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators