A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits

Bogunovic, Ilija; Li, Zihan; Krause, Andreas; Scarlett, Jonathan

Statistics > Machine Learning

arXiv:2202.01850 (stat)

[Submitted on 3 Feb 2022 (v1), last revised 28 Mar 2022 (this version, v2)]

Title:A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits

Authors:Ilija Bogunovic, Zihan Li, Andreas Krause, Jonathan Scarlett

View PDF

Abstract:We consider the sequential optimization of an unknown, continuous, and expensive to evaluate reward function, from noisy and adversarially corrupted observed rewards. When the corruption attacks are subject to a suitable budget $C$ and the function lives in a Reproducing Kernel Hilbert Space (RKHS), the problem can be posed as corrupted Gaussian process (GP) bandit optimization. We propose a novel robust elimination-type algorithm that runs in epochs, combines exploration with infrequent switching to select a small subset of actions, and plays each action for multiple time instants. Our algorithm, Robust GP Phased Elimination (RGP-PE), successfully balances robustness to corruptions with exploration and exploitation such that its performance degrades minimally in the presence (or absence) of adversarial corruptions. When $T$ is the number of samples and $\gamma_T$ is the maximal information gain, the corruption-dependent term in our regret bound is $O(C \gamma_T^{3/2})$, which is significantly tighter than the existing $O(C \sqrt{T \gamma_T})$ for several commonly-considered kernels. We perform the first empirical study of robustness in the corrupted GP bandit setting, and show that our algorithm is robust against a variety of adversarial attacks.

Comments:	Added references
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2202.01850 [stat.ML]
	(or arXiv:2202.01850v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2202.01850

Submission history

From: Ilija Bogunovic [view email]
[v1] Thu, 3 Feb 2022 21:19:36 UTC (1,469 KB)
[v2] Mon, 28 Mar 2022 21:57:13 UTC (1,482 KB)

Statistics > Machine Learning

Title:A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:A Robust Phased Elimination Algorithm for Corruption-Tolerant Gaussian Process Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators