Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits

Syrgkanis, Vasilis; Luo, Haipeng; Krishnamurthy, Akshay; Schapire, Robert E.

Computer Science > Machine Learning

arXiv:1606.00313 (cs)

[Submitted on 1 Jun 2016]

Title:Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits

Authors:Vasilis Syrgkanis, Haipeng Luo, Akshay Krishnamurthy, Robert E. Schapire

View PDF

Abstract:We give an oracle-based algorithm for the adversarial contextual bandit problem, where either contexts are drawn i.i.d. or the sequence of contexts is known a priori, but where the losses are picked adversarially. Our algorithm is computationally efficient, assuming access to an offline optimization oracle, and enjoys a regret of order $O((KT)^{\frac{2}{3}}(\log N)^{\frac{1}{3}})$, where $K$ is the number of actions, $T$ is the number of iterations and $N$ is the number of baseline policies. Our result is the first to break the $O(T^{\frac{3}{4}})$ barrier that is achieved by recently introduced algorithms. Breaking this barrier was left as a major open problem. Our analysis is based on the recent relaxation based approach of (Rakhlin and Sridharan, 2016).

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:1606.00313 [cs.LG]
	(or arXiv:1606.00313v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1606.00313

Submission history

From: Vasilis Syrgkanis [view email]
[v1] Wed, 1 Jun 2016 14:47:19 UTC (23 KB)

Computer Science > Machine Learning

Title:Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Improved Regret Bounds for Oracle-Based Adversarial Contextual Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators