Preconditioning for Scalable Gaussian Process Hyperparameter Optimization

Wenger, Jonathan; Pleiss, Geoff; Hennig, Philipp; Cunningham, John P.; Gardner, Jacob R.

Computer Science > Machine Learning

arXiv:2107.00243 (cs)

[Submitted on 1 Jul 2021 (v1), last revised 18 Jun 2022 (this version, v5)]

Title:Preconditioning for Scalable Gaussian Process Hyperparameter Optimization

Authors:Jonathan Wenger, Geoff Pleiss, Philipp Hennig, John P. Cunningham, Jacob R. Gardner

View PDF

Abstract:Gaussian process hyperparameter optimization requires linear solves with, and log-determinants of, large kernel matrices. Iterative numerical techniques are becoming popular to scale to larger datasets, relying on the conjugate gradient method (CG) for the linear solves and stochastic trace estimation for the log-determinant. This work introduces new algorithmic and theoretical insights for preconditioning these computations. While preconditioning is well understood in the context of CG, we demonstrate that it can also accelerate convergence and reduce variance of the estimates for the log-determinant and its derivative. We prove general probabilistic error bounds for the preconditioned computation of the log-determinant, log-marginal likelihood and its derivatives. Additionally, we derive specific rates for a range of kernel-preconditioner combinations, showing that up to exponential convergence can be achieved. Our theoretical results enable provably efficient optimization of kernel hyperparameters, which we validate empirically on large-scale benchmark problems. There our approach accelerates training by up to an order of magnitude.

Comments:	International Conference on Machine Learning (ICML)
Subjects:	Machine Learning (cs.LG); Numerical Analysis (math.NA)
Cite as:	arXiv:2107.00243 [cs.LG]
	(or arXiv:2107.00243v5 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2107.00243

Submission history

From: Jonathan Wenger [view email]
[v1] Thu, 1 Jul 2021 06:43:11 UTC (3,049 KB)
[v2] Fri, 28 Jan 2022 19:17:26 UTC (1,900 KB)
[v3] Tue, 1 Feb 2022 15:38:06 UTC (2,409 KB)
[v4] Sun, 12 Jun 2022 22:48:12 UTC (1,901 KB)
[v5] Sat, 18 Jun 2022 21:38:55 UTC (1,901 KB)

Computer Science > Machine Learning

Title:Preconditioning for Scalable Gaussian Process Hyperparameter Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Preconditioning for Scalable Gaussian Process Hyperparameter Optimization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators