The Power of Populations in Decentralized Bandits

Lazarsfeld, John; Alistarh, Dan

Computer Science > Machine Learning

arXiv:2306.08670v3 (cs)

[Submitted on 14 Jun 2023 (v1), revised 1 Feb 2024 (this version, v3), latest version 8 Jul 2024 (v4)]

Title:The Power of Populations in Decentralized Bandits

Authors:John Lazarsfeld, Dan Alistarh

View PDF HTML (experimental)

Abstract:We study a cooperative multi-agent bandit setting in the distributed GOSSIP model: in every round, each of $n$ agents chooses an action from a common set, observes the action's corresponding reward, and subsequently exchanges information with a single randomly chosen neighbor, which informs its policy in the next round. We introduce and analyze several families of fully-decentralized local algorithms in this setting under the constraint that each agent has only constant memory. We highlight a connection between the global evolution of such decentralized algorithms and a new class of "zero-sum" multiplicative weights update methods, and we develop a general framework for analyzing the population-level regret of these natural protocols. Using this framework, we derive sublinear regret bounds for both stationary and adversarial reward settings. Moreover, we show that these simple local algorithms can approximately optimize convex functions over the simplex, assuming that the reward distributions are generated from a stochastic gradient oracle.

Subjects:	Machine Learning (cs.LG); Distributed, Parallel, and Cluster Computing (cs.DC); Data Structures and Algorithms (cs.DS)
Cite as:	arXiv:2306.08670 [cs.LG]
	(or arXiv:2306.08670v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.08670

Submission history

From: John Lazarsfeld [view email]
[v1] Wed, 14 Jun 2023 17:59:15 UTC (509 KB)
[v2] Thu, 19 Oct 2023 15:19:05 UTC (512 KB)
[v3] Thu, 1 Feb 2024 18:54:21 UTC (581 KB)
[v4] Mon, 8 Jul 2024 15:56:39 UTC (199 KB)

Computer Science > Machine Learning

Title:The Power of Populations in Decentralized Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Power of Populations in Decentralized Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators