A Simple Data Augmentation for Feature Distribution Skewed Federated Learning

Yan, Yunlu; Fu, Huazhu; Li, Yuexiang; Xie, Jinheng; Ma, Jun; Yang, Guang; Zhu, Lei

Computer Science > Machine Learning

arXiv:2306.09363 (cs)

[Submitted on 14 Jun 2023 (v1), last revised 6 Dec 2024 (this version, v2)]

Title:A Simple Data Augmentation for Feature Distribution Skewed Federated Learning

Authors:Yunlu Yan, Huazhu Fu, Yuexiang Li, Jinheng Xie, Jun Ma, Guang Yang, Lei Zhu

View PDF HTML (experimental)

Abstract:Federated Learning (FL) facilitates collaborative learning among multiple clients in a distributed manner and ensures the security of privacy. However, its performance inevitably degrades with non-Independent and Identically Distributed (non-IID) data. In this paper, we focus on the feature distribution skewed FL scenario, a common non-IID situation in real-world applications where data from different clients exhibit varying underlying distributions. This variation leads to feature shift, which is a key issue of this scenario. While previous works have made notable progress, few pay attention to the data itself, i.e., the root of this issue. The primary goal of this paper is to mitigate feature shift from the perspective of data. To this end, we propose a simple yet remarkably effective input-level data augmentation method, namely FedRDN, which randomly injects the statistical information of the local distribution from the entire federation into the client's data. This is beneficial to improve the generalization of local feature representations, thereby mitigating feature shift. Moreover, our FedRDN is a plug-and-play component, which can be seamlessly integrated into the data augmentation flow with only a few lines of code. Extensive experiments on several datasets show that the performance of various representative FL methods can be further improved by integrating our FedRDN, demonstrating its effectiveness, strong compatibility and generalizability. Code will be released.

Comments:	11 pages, 3 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.09363 [cs.LG]
	(or arXiv:2306.09363v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.09363

Submission history

From: Yunlu Yan [view email]
[v1] Wed, 14 Jun 2023 05:46:52 UTC (1,634 KB)
[v2] Fri, 6 Dec 2024 05:35:09 UTC (948 KB)

Computer Science > Machine Learning

Title:A Simple Data Augmentation for Feature Distribution Skewed Federated Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Simple Data Augmentation for Feature Distribution Skewed Federated Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators