Online Multiclass Boosting with Bandit Feedback

Zhang, Daniel T.; Jung, Young Hun; Tewari, Ambuj

Statistics > Machine Learning

arXiv:1810.05290 (stat)

[Submitted on 11 Oct 2018 (v1), last revised 25 Feb 2019 (this version, v2)]

Title:Online Multiclass Boosting with Bandit Feedback

Authors:Daniel T. Zhang, Young Hun Jung, Ambuj Tewari

View PDF

Abstract:We present online boosting algorithms for multiclass classification with bandit feedback, where the learner only receives feedback about the correctness of its prediction. We propose an unbiased estimate of the loss using a randomized prediction, allowing the model to update its weak learners with limited information. Using the unbiased estimate, we extend two full information boosting algorithms (Jung et al., 2017) to the bandit setting. We prove that the asymptotic error bounds of the bandit algorithms exactly match their full information counterparts. The cost of restricted feedback is reflected in the larger sample complexity. Experimental results also support our theoretical findings, and performance of the proposed models is comparable to that of an existing bandit boosting algorithm, which is limited to use binary weak learners.

Comments:	Accepted in AISTATS 2019
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1810.05290 [stat.ML]
	(or arXiv:1810.05290v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1810.05290

Submission history

From: Young Hun Jung [view email]
[v1] Thu, 11 Oct 2018 23:47:21 UTC (228 KB)
[v2] Mon, 25 Feb 2019 05:28:45 UTC (228 KB)

Statistics > Machine Learning

Title:Online Multiclass Boosting with Bandit Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Online Multiclass Boosting with Bandit Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators