MREC: a fast and versatile framework for aligning and matching point clouds with applications to single cell molecular data

Blumberg, Andrew J.; Carriere, Mathieu; Mandell, Michael A.; Rabadan, Raul; Villar, Soledad

Statistics > Machine Learning

arXiv:2001.01666 (stat)

[Submitted on 6 Jan 2020 (v1), last revised 20 Feb 2020 (this version, v3)]

Title:MREC: a fast and versatile framework for aligning and matching point clouds with applications to single cell molecular data

Authors:Andrew J. Blumberg, Mathieu Carriere, Michael A. Mandell, Raul Rabadan, Soledad Villar

View PDF

Abstract:Comparing and aligning large datasets is a pervasive problem occurring across many different knowledge domains. We introduce and study MREC, a recursive decomposition algorithm for computing matchings between data sets. The basic idea is to partition the data, match the partitions, and then recursively match the points within each pair of identified partitions. The matching itself is done using black box matching procedures that are too expensive to run on the entire data set. Using an absolute measure of the quality of a matching, the framework supports optimization over parameters including partitioning procedures and matching algorithms. By design, MREC can be applied to extremely large data sets. We analyze the procedure to describe when we can expect it to work well and demonstrate its flexibility and power by applying it to a number of alignment problems arising in the analysis of single cell molecular data.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Genomics (q-bio.GN)
Cite as:	arXiv:2001.01666 [stat.ML]
	(or arXiv:2001.01666v3 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2001.01666

Submission history

From: Mathieu Carrière [view email]
[v1] Mon, 6 Jan 2020 17:02:16 UTC (684 KB)
[v2] Tue, 7 Jan 2020 06:26:35 UTC (684 KB)
[v3] Thu, 20 Feb 2020 22:17:02 UTC (1,460 KB)

Statistics > Machine Learning

Title:MREC: a fast and versatile framework for aligning and matching point clouds with applications to single cell molecular data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:MREC: a fast and versatile framework for aligning and matching point clouds with applications to single cell molecular data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators