Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs

Compagnoni, Enea Monzio; Islamov, Rustem; Proske, Frank Norbert; Lucchi, Aurelien

Computer Science > Machine Learning

arXiv:2502.17009 (cs)

[Submitted on 24 Feb 2025 (v1), last revised 28 Feb 2025 (this version, v2)]

Title:Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs

Authors:Enea Monzio Compagnoni, Rustem Islamov, Frank Norbert Proske, Aurelien Lucchi

View PDF HTML (experimental)

Abstract:Distributed methods are essential for handling machine learning pipelines comprising large-scale models and datasets. However, their benefits often come at the cost of increased communication overhead between the central server and agents, which can become the main bottleneck, making training costly or even unfeasible in such systems. Compression methods such as quantization and sparsification can alleviate this issue. Still, their robustness to large and heavy-tailed gradient noise, a phenomenon sometimes observed in language modeling, remains poorly understood. This work addresses this gap by analyzing Distributed Compressed SGD (DCSGD) and Distributed SignSGD (DSignSGD) using stochastic differential equations (SDEs). Our results show that DCSGD with unbiased compression is more vulnerable to noise in stochastic gradients, while DSignSGD remains robust, even under large and heavy-tailed noise. Additionally, we propose new scaling rules for hyperparameter tuning to mitigate performance degradation due to compression. These findings are empirically validated across multiple deep learning architectures and datasets, providing practical recommendations for distributed optimization.

Comments:	Accepted at AISTATS 2025 (Oral). arXiv admin note: substantial text overlap with arXiv:2411.15958
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2502.17009 [cs.LG]
	(or arXiv:2502.17009v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.17009

Submission history

From: Enea Monzio Compagnoni Mr. [view email]
[v1] Mon, 24 Feb 2025 09:39:17 UTC (2,730 KB)
[v2] Fri, 28 Feb 2025 00:12:11 UTC (2,729 KB)

Computer Science > Machine Learning

Title:Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators