Simultaneously Achieving Group Exposure Fairness and Within-Group Meritocracy in Stochastic Bandits

Pokhriyal, Subham; Jain, Shweta; Ghalme, Ganesh; Dhamal, Swapnil; Gujar, Sujit

Computer Science > Machine Learning

arXiv:2402.05575 (cs)

[Submitted on 8 Feb 2024]

Title:Simultaneously Achieving Group Exposure Fairness and Within-Group Meritocracy in Stochastic Bandits

Authors:Subham Pokhriyal, Shweta Jain, Ganesh Ghalme, Swapnil Dhamal, Sujit Gujar

View PDF

Abstract:Existing approaches to fairness in stochastic multi-armed bandits (MAB) primarily focus on exposure guarantee to individual arms. When arms are naturally grouped by certain attribute(s), we propose Bi-Level Fairness, which considers two levels of fairness. At the first level, Bi-Level Fairness guarantees a certain minimum exposure to each group. To address the unbalanced allocation of pulls to individual arms within a group, we consider meritocratic fairness at the second level, which ensures that each arm is pulled according to its merit within the group. Our work shows that we can adapt a UCB-based algorithm to achieve a Bi-Level Fairness by providing (i) anytime Group Exposure Fairness guarantees and (ii) ensuring individual-level Meritocratic Fairness within each group. We first show that one can decompose regret bounds into two components: (a) regret due to anytime group exposure fairness and (b) regret due to meritocratic fairness within each group. Our proposed algorithm BF-UCB balances these two regrets optimally to achieve the upper bound of $O(\sqrt{T})$ on regret; $T$ being the stopping time. With the help of simulated experiments, we further show that BF-UCB achieves sub-linear regret; provides better group and individual exposure guarantees compared to existing algorithms; and does not result in a significant drop in reward with respect to UCB algorithm, which does not impose any fairness constraint.

Comments:	Accepted in AAMAS 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computers and Society (cs.CY); Multiagent Systems (cs.MA)
Cite as:	arXiv:2402.05575 [cs.LG]
	(or arXiv:2402.05575v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.05575

Submission history

From: Shweta Jain [view email]
[v1] Thu, 8 Feb 2024 11:19:58 UTC (1,224 KB)

Computer Science > Machine Learning

Title:Simultaneously Achieving Group Exposure Fairness and Within-Group Meritocracy in Stochastic Bandits

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Simultaneously Achieving Group Exposure Fairness and Within-Group Meritocracy in Stochastic Bandits

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators