Sketching for Latent Dirichlet-Categorical Models

Tassarotti, Joseph; Tristan, Jean-Baptiste; Wick, Michael

Computer Science > Machine Learning

arXiv:1810.01400 (cs)

[Submitted on 2 Oct 2018]

Title:Sketching for Latent Dirichlet-Categorical Models

Authors:Joseph Tassarotti, Jean-Baptiste Tristan, Michael Wick

View PDF

Abstract:Recent work has explored transforming data sets into smaller, approximate summaries in order to scale Bayesian inference. We examine a related problem in which the parameters of a Bayesian model are very large and expensive to store in memory, and propose more compact representations of parameter values that can be used during inference. We focus on a class of graphical models that we refer to as latent Dirichlet-Categorical models, and show how a combination of two sketching algorithms known as count-min sketch and approximate counters provide an efficient representation for them. We show that this sketch combination -- which, despite having been used before in NLP applications, has not been previously analyzed -- enjoys desirable properties. We prove that for this class of models, when the sketches are used during Markov Chain Monte Carlo inference, the equilibrium of sketched MCMC converges to that of the exact chain as sketch parameters are tuned to reduce the error rate.

Comments:	20 pages
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1810.01400 [cs.LG]
	(or arXiv:1810.01400v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1810.01400

Submission history

From: Joseph Tassarotti [view email]
[v1] Tue, 2 Oct 2018 17:47:04 UTC (1,802 KB)

Computer Science > Machine Learning

Title:Sketching for Latent Dirichlet-Categorical Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sketching for Latent Dirichlet-Categorical Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators