Bayesian Boolean Matrix Factorisation

Rukat, Tammo; Holmes, Chris C.; Titsias, Michalis K.; Yau, Christopher

Statistics > Machine Learning

arXiv:1702.06166v1 (stat)

[Submitted on 20 Feb 2017 (this version), latest version 25 Feb 2017 (v2)]

Title:Bayesian Boolean Matrix Factorisation

Authors:Tammo Rukat, Chris C. Holmes, Michalis K. Titsias, Christopher Yau

View PDF

Abstract:Boolean matrix factorisation (BooMF) infers interpretable decompositions of a binary data matrix into a pair of low-rank, binary matrices: One containing meaningful patterns, the other quantifying how the observations can be expressed as a combination of these patterns. We introduce the OrMachine, a probabilistic generative model for BooMF and derive a Metropolised Gibbs sampler that facilitates very efficient parallel posterior inference. Our method outperforms all currently existing approaches for Boolean Matrix factorization and completion, as we show on simulated and real world data. This is the first method to provide full posterior inference for BooMF which is relevant in applications, e.g. for controlling false positive rates in collaborative filtering, and crucially it improves the interpretability of the inferred patterns. The proposed algorithm scales to large datasets as we demonstrate by analysing single cell gene expression data in 1.3 million mouse brain cells across 11,000 genes on commodity hardware.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG); Numerical Analysis (math.NA); Genomics (q-bio.GN); Quantitative Methods (q-bio.QM); Methodology (stat.ME)
Cite as:	arXiv:1702.06166 [stat.ML]
	(or arXiv:1702.06166v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1702.06166

Submission history

From: Tammo Rukat [view email]
[v1] Mon, 20 Feb 2017 20:31:39 UTC (355 KB)
[v2] Sat, 25 Feb 2017 14:17:44 UTC (414 KB)

Statistics > Machine Learning

Title:Bayesian Boolean Matrix Factorisation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Bayesian Boolean Matrix Factorisation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators