Heterogeneous Multi-Agent Bandits with Parsimonious Hints

Mirfakhar, Amirmahdi; Wang, Xuchuang; Zuo, Jinhang; Zick, Yair; Hajiesmaili, Mohammad

Computer Science > Machine Learning

arXiv:2502.16128 (cs)

[Submitted on 22 Feb 2025]

Title:Heterogeneous Multi-Agent Bandits with Parsimonious Hints

Authors:Amirmahdi Mirfakhar, Xuchuang Wang, Jinhang Zuo, Yair Zick, Mohammad Hajiesmaili

View PDF HTML (experimental)

Abstract:We study a hinted heterogeneous multi-agent multi-armed bandits problem (HMA2B), where agents can query low-cost observations (hints) in addition to pulling arms. In this framework, each of the $M$ agents has a unique reward distribution over $K$ arms, and in $T$ rounds, they can observe the reward of the arm they pull only if no other agent pulls that arm. The goal is to maximize the total utility by querying the minimal necessary hints without pulling arms, achieving time-independent regret. We study HMA2B in both centralized and decentralized setups. Our main centralized algorithm, GP-HCLA, which is an extension of HCLA, uses a central decision-maker for arm-pulling and hint queries, achieving $O(M^4K)$ regret with $O(MK\log T)$ adaptive hints. In decentralized setups, we propose two algorithms, HD-ETC and EBHD-ETC, that allow agents to choose actions independently through collision-based communication and query hints uniformly until stopping, yielding $O(M^3K^2)$ regret with $O(M^3K\log T)$ hints, where the former requires knowledge of the minimum gap and the latter does not. Finally, we establish lower bounds to prove the optimality of our results and verify them through numerical simulations.

Comments:	Accepted at AAAI-2025
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Multiagent Systems (cs.MA)
Cite as:	arXiv:2502.16128 [cs.LG]
	(or arXiv:2502.16128v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.16128

Submission history

From: Amirmahdi Mirfakhar [view email]
[v1] Sat, 22 Feb 2025 07:46:41 UTC (29,865 KB)

Computer Science > Machine Learning

Title:Heterogeneous Multi-Agent Bandits with Parsimonious Hints

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Heterogeneous Multi-Agent Bandits with Parsimonious Hints

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators