On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications

Shi, Chengshuai; Shen, Cong

Computer Science > Information Theory

arXiv:2011.01090v1 (cs)

[Submitted on 2 Nov 2020 (this version), latest version 24 Apr 2021 (v2)]

Title:On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications

Authors:Chengshuai Shi, Cong Shen

View PDF

Abstract:We study the notoriously difficult no-sensing adversarial multi-player multi-armed bandits (MP-MAB) problem from a new perspective. Instead of focusing on the hardness of multiple players, we introduce a new dimension of hardness, called attackability. All adversaries can be categorized based on the attackability and we introduce Adversary-Adaptive Collision-Communication (A2C2), a family of algorithms with forced-collision communication among players. Both attackability-aware and unaware settings are studied, and information-theoretic tools of the Z-channel model and error-correction coding are utilized to address the challenge of implicit communication without collision information in an adversarial environment. For the more challenging attackability-unaware problem, we propose a simple method to estimate the attackability enabled by a novel error-detection repetition code and randomized communication for synchronization. Theoretical analysis proves that asymptotic attackability-dependent sublinear regret can be achieved, with or without knowing the attackability. In particular, the asymptotic regret does not have an exponential dependence on the number of players, revealing a fundamental tradeoff between the two dimensions of hardness in this problem.

Comments:	35 pages, 5 figures
Subjects:	Information Theory (cs.IT); Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2011.01090 [cs.IT]
	(or arXiv:2011.01090v1 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.2011.01090

Submission history

From: Chengshuai Shi [view email]
[v1] Mon, 2 Nov 2020 16:21:18 UTC (1,703 KB)
[v2] Sat, 24 Apr 2021 18:36:25 UTC (1,371 KB)

Computer Science > Information Theory

Title:On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Theory

Title:On No-Sensing Adversarial Multi-player Multi-armed Bandits with Collision Communications

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators