Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

Min, Yifei; Wang, Tianhao; Xu, Ruitu; Wang, Zhaoran; Jordan, Michael I.; Yang, Zhuoran

Computer Science > Machine Learning

arXiv:2203.03684 (cs)

[Submitted on 7 Mar 2022]

Title:Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

Authors:Yifei Min, Tianhao Wang, Ruitu Xu, Zhaoran Wang, Michael I. Jordan, Zhuoran Yang

View PDF

Abstract:We study a Markov matching market involving a planner and a set of strategic agents on the two sides of the market. At each step, the agents are presented with a dynamical context, where the contexts determine the utilities. The planner controls the transition of the contexts to maximize the cumulative social welfare, while the agents aim to find a myopic stable matching at each step. Such a setting captures a range of applications including ridesharing platforms. We formalize the problem by proposing a reinforcement learning framework that integrates optimistic value iteration with maximum weight matching. The proposed algorithm addresses the coupled challenges of sequential exploration, matching stability, and function approximation. We prove that the algorithm achieves sublinear regret.

Comments:	40 pages
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Science and Game Theory (cs.GT); Statistics Theory (math.ST)
Cite as:	arXiv:2203.03684 [cs.LG]
	(or arXiv:2203.03684v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2203.03684

Submission history

From: Zhuoran Yang [view email]
[v1] Mon, 7 Mar 2022 19:51:25 UTC (559 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-03

Change to browse by:

cs
cs.AI
cs.GT
math
math.ST
stat
stat.TH

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators