Federated unsupervised random forest for privacy-preserving patient stratification

Pfeifer, Bastian; Sirocchi, Christel; Bloice, Marcus D.; Kreuzthaler, Markus; Urschler, Martin

Computer Science > Machine Learning

arXiv:2401.16094 (cs)

[Submitted on 29 Jan 2024]

Title:Federated unsupervised random forest for privacy-preserving patient stratification

Authors:Bastian Pfeifer, Christel Sirocchi, Marcus D. Bloice, Markus Kreuzthaler, Martin Urschler

View PDF

Abstract:In the realm of precision medicine, effective patient stratification and disease subtyping demand innovative methodologies tailored for multi-omics data. Clustering techniques applied to multi-omics data have become instrumental in identifying distinct subgroups of patients, enabling a finer-grained understanding of disease variability. This work establishes a powerful framework for advancing precision medicine through unsupervised random-forest-based clustering and federated computing. We introduce a novel multi-omics clustering approach utilizing unsupervised random-forests. The unsupervised nature of the random forest enables the determination of cluster-specific feature importance, unraveling key molecular contributors to distinct patient groups. Moreover, our methodology is designed for federated execution, a crucial aspect in the medical domain where privacy concerns are paramount. We have validated our approach on machine learning benchmark data sets as well as on cancer data from The Cancer Genome Atlas (TCGA). Our method is competitive with the state-of-the-art in terms of disease subtyping, but at the same time substantially improves the cluster interpretability. Experiments indicate that local clustering performance can be improved through federated computing.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Quantitative Methods (q-bio.QM)
Cite as:	arXiv:2401.16094 [cs.LG]
	(or arXiv:2401.16094v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2401.16094

Submission history

From: Bastian Pfeifer [view email]
[v1] Mon, 29 Jan 2024 12:04:14 UTC (11,803 KB)

Computer Science > Machine Learning

Title:Federated unsupervised random forest for privacy-preserving patient stratification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Federated unsupervised random forest for privacy-preserving patient stratification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators