Cooperative learning for multiview analysis

Ding, Daisy Yi; Li, Shuangning; Narasimhan, Balasubramanian; Tibshirani, Robert

doi:10.1073/pnas.2202113119

Statistics > Methodology

arXiv:2112.12337 (stat)

[Submitted on 23 Dec 2021 (v1), last revised 3 Sep 2022 (this version, v6)]

Title:Cooperative learning for multiview analysis

Authors:Daisy Yi Ding, Shuangning Li, Balasubramanian Narasimhan, Robert Tibshirani

View PDF

Abstract:We propose a new method for supervised learning with multiple sets of features ("views"). The multiview problem is especially important in biology and medicine, where "-omics" data such as genomics, proteomics and radiomics are measured on a common set of samples. Cooperative learning combines the usual squared error loss of predictions with an "agreement" penalty to encourage the predictions from different data views to agree. By varying the weight of the agreement penalty, we get a continuum of solutions that include the well-known early and late fusion approaches. Cooperative learning chooses the degree of agreement (or fusion) in an adaptive manner, using a validation set or cross-validation to estimate test set prediction error. One version of our fitting procedure is modular, where one can choose different fitting mechanisms (e.g. lasso, random forests, boosting, neural networks) appropriate for different data views. In the setting of cooperative regularized linear regression, the method combines the lasso penalty with the agreement penalty, yielding feature sparsity. The method can be especially powerful when the different data views share some underlying relationship in their signals that can be exploited to boost the signals. We show that cooperative learning achieves higher predictive accuracy on simulated data and a real multiomics example of labor onset prediction. Leveraging aligned signals and allowing flexible fitting mechanisms for different modalities, cooperative learning offers a powerful approach to multiomics data fusion.

Subjects:	Methodology (stat.ME); Quantitative Methods (q-bio.QM); Machine Learning (stat.ML)
Cite as:	arXiv:2112.12337 [stat.ME]
	(or arXiv:2112.12337v6 [stat.ME] for this version)
	https://doi.org/10.48550/arXiv.2112.12337
Related DOI:	https://doi.org/10.1073/pnas.2202113119

Submission history

From: Daisy Yi Ding [view email]
[v1] Thu, 23 Dec 2021 03:13:25 UTC (401 KB)
[v2] Tue, 28 Dec 2021 08:25:56 UTC (725 KB)
[v3] Thu, 6 Jan 2022 05:46:08 UTC (725 KB)
[v4] Mon, 31 Jan 2022 21:46:04 UTC (6,927 KB)
[v5] Thu, 9 Jun 2022 00:11:54 UTC (26,749 KB)
[v6] Sat, 3 Sep 2022 05:55:28 UTC (26,736 KB)

Statistics > Methodology

Title:Cooperative learning for multiview analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Methodology

Title:Cooperative learning for multiview analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators