Cost-aware Cascading Bandits

Zhou, Ruida; Gan, Chao; Yan, Jing; Shen, Cong

Computer Science > Machine Learning

arXiv:1805.08638 (cs)

[Submitted on 22 May 2018]

Title:Cost-aware Cascading Bandits

Authors:Ruida Zhou, Chao Gan, Jing Yan, Cong Shen

View PDF

Abstract:In this paper, we propose a cost-aware cascading bandits model, a new variant of multi-armed ban- dits with cascading feedback, by considering the random cost of pulling arms. In each step, the learning agent chooses an ordered list of items and examines them sequentially, until certain stopping condition is satisfied. Our objective is then to max- imize the expected net reward in each step, i.e., the reward obtained in each step minus the total cost in- curred in examining the items, by deciding the or- dered list of items, as well as when to stop examina- tion. We study both the offline and online settings, depending on whether the state and cost statistics of the items are known beforehand. For the of- fline setting, we show that the Unit Cost Ranking with Threshold 1 (UCR-T1) policy is optimal. For the online setting, we propose a Cost-aware Cas- cading Upper Confidence Bound (CC-UCB) algo- rithm, and show that the cumulative regret scales in O(log T ). We also provide a lower bound for all {\alpha}-consistent policies, which scales in {\Omega}(log T ) and matches our upper bound. The performance of the CC-UCB algorithm is evaluated with both synthetic and real-world data.

Comments:	11 pages, 2 figures, IJCAI 2018
Subjects:	Machine Learning (cs.LG); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:1805.08638 [cs.LG]
	(or arXiv:1805.08638v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1805.08638

Submission history

From: Ruida Zhou [view email]
[v1] Tue, 22 May 2018 14:39:30 UTC (88 KB)

Computer Science > Machine Learning

Title:Cost-aware Cascading Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Cost-aware Cascading Bandits

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators