Subsampling Sequential Monte Carlo for Static Bayesian Models

Gunawan, David; Dang, Khue-Dung; Quiroz, Matias; Kohn, Robert; Tran, Minh-Ngoc

Statistics > Computation

arXiv:1805.03317 (stat)

[Submitted on 8 May 2018 (v1), last revised 24 Mar 2020 (this version, v3)]

Title:Subsampling Sequential Monte Carlo for Static Bayesian Models

Authors:David Gunawan, Khue-Dung Dang, Matias Quiroz, Robert Kohn, Minh-Ngoc Tran

View PDF

Abstract:We show how to speed up Sequential Monte Carlo (SMC) for Bayesian inference in large data problems by data subsampling. SMC sequentially updates a cloud of particles through a sequence of distributions, beginning with a distribution that is easy to sample from such as the prior and ending with the posterior distribution. Each update of the particle cloud consists of three steps: reweighting, resampling, and moving. In the move step, each particle is moved using a Markov kernel; this is typically the most computationally expensive part, particularly when the dataset is large. It is crucial to have an efficient move step to ensure particle diversity. Our article makes two important contributions. First, in order to speed up the SMC computation, we use an approximately unbiased and efficient annealed likelihood estimator based on data subsampling. The subsampling approach is more memory efficient than the corresponding full data SMC, which is an advantage for parallel computation. Second, we use a Metropolis within Gibbs kernel with two conditional updates. A Hamiltonian Monte Carlo update makes distant moves for the model parameters, and a block pseudo-marginal proposal is used for the particles corresponding to the auxiliary variables for the data subsampling. We demonstrate both the usefulness and limitations of the methodology for estimating four generalized linear models and a generalized additive model with large datasets.

Subjects:	Computation (stat.CO); Methodology (stat.ME); Machine Learning (stat.ML)
Cite as:	arXiv:1805.03317 [stat.CO]
	(or arXiv:1805.03317v3 [stat.CO] for this version)
	https://doi.org/10.48550/arXiv.1805.03317

Submission history

From: Matias Quiroz [view email]
[v1] Tue, 8 May 2018 23:17:01 UTC (689 KB)
[v2] Tue, 23 Apr 2019 08:12:12 UTC (830 KB)
[v3] Tue, 24 Mar 2020 10:36:25 UTC (1,584 KB)

Statistics > Computation

Title:Subsampling Sequential Monte Carlo for Static Bayesian Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Computation

Title:Subsampling Sequential Monte Carlo for Static Bayesian Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators