The Sample-Communication Complexity Trade-off in Federated Q-Learning

Salgia, Sudeep; Chi, Yuejie

Computer Science > Machine Learning

arXiv:2408.16981 (cs)

[Submitted on 30 Aug 2024 (v1), last revised 29 Oct 2024 (this version, v2)]

Title:The Sample-Communication Complexity Trade-off in Federated Q-Learning

Authors:Sudeep Salgia, Yuejie Chi

View PDF HTML (experimental)

Abstract:We consider the problem of federated Q-learning, where $M$ agents aim to collaboratively learn the optimal Q-function of an unknown infinite-horizon Markov decision process with finite state and action spaces. We investigate the trade-off between sample and communication complexities for the widely used class of intermittent communication algorithms. We first establish the converse result, where it is shown that a federated Q-learning algorithm that offers any speedup with respect to the number of agents in the per-agent sample complexity needs to incur a communication cost of at least an order of $\frac{1}{1-\gamma}$ up to logarithmic factors, where $\gamma$ is the discount factor. We also propose a new algorithm, called Fed-DVR-Q, which is the first federated Q-learning algorithm to simultaneously achieve order-optimal sample and communication complexities. Thus, together these results provide a complete characterization of the sample-communication complexity trade-off in federated Q-learning.

Comments:	Accepted to NeurIPS 2024
Subjects:	Machine Learning (cs.LG); Optimization and Control (math.OC); Machine Learning (stat.ML)
Cite as:	arXiv:2408.16981 [cs.LG]
	(or arXiv:2408.16981v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2408.16981

Submission history

From: Sudeep Salgia [view email]
[v1] Fri, 30 Aug 2024 03:03:03 UTC (152 KB)
[v2] Tue, 29 Oct 2024 20:37:04 UTC (562 KB)

Computer Science > Machine Learning

Title:The Sample-Communication Complexity Trade-off in Federated Q-Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:The Sample-Communication Complexity Trade-off in Federated Q-Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators