Correlated Components Analysis --- Extracting Reliable Dimensions in Multivariate Data

Parra, Lucas C.; Haufe, Stefan; Dmochowski, Jacek P.

Statistics > Machine Learning

arXiv:1801.08881v2 (stat)

[Submitted on 26 Jan 2018 (v1), revised 10 Feb 2018 (this version, v2), latest version 20 Jan 2019 (v5)]

Title:Correlated Components Analysis --- Extracting Reliable Dimensions in Multivariate Data

Authors:Lucas C. Parra, Stefan Haufe, Jacek P. Dmochowski

View PDF

Abstract:How does one find data dimensions that are reliably expressed across repetitions? For example, in neuroscience one may want to identify combinations of brain signals that are reliably activated across multiple trials or subjects. For a clinical assessment with multiple ratings, one may want to identify an aggregate score that is reliably reproduced across raters. The approach proposed here --- "correlated components analysis" --- is to identify components that maximally correlate between repetitions (e.g. trials, subjects, raters). This can be expressed as the maximization of the ratio of between-repetition to within-repetition covariance, resulting in a generalized eigenvalue problem. We show that covariances can be computed efficiently without explicitly considering all pairs of repetitions, that the result is equivalent to multi-class linear discriminant analysis for unbiased signals, and that the approach also maximize reliability, defined as the mean divided by the deviation across repetitions. We also extend the method to non-linear components using kernels, discuss regularization to improve numerical stability, present parametric and non-parametric tests to establish statistical significance, and provide code.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1801.08881 [stat.ML]
	(or arXiv:1801.08881v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1801.08881

Submission history

From: Jacek Dmochowski [view email]
[v1] Fri, 26 Jan 2018 16:12:07 UTC (2,832 KB)
[v2] Sat, 10 Feb 2018 19:19:10 UTC (2,832 KB)
[v3] Mon, 7 May 2018 16:05:30 UTC (2,757 KB)
[v4] Mon, 10 Sep 2018 22:02:19 UTC (3,106 KB)
[v5] Sun, 20 Jan 2019 21:15:39 UTC (5,506 KB)

Statistics > Machine Learning

Title:Correlated Components Analysis --- Extracting Reliable Dimensions in Multivariate Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Correlated Components Analysis --- Extracting Reliable Dimensions in Multivariate Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators