Exact Formulas for Finite-Time Estimation Errors of Decentralized Temporal Difference Learning with Linear Function Approximation

Guo, Xingang; Hu, Bin

Computer Science > Machine Learning

arXiv:2204.09801 (cs)

[Submitted on 20 Apr 2022]

Title:Exact Formulas for Finite-Time Estimation Errors of Decentralized Temporal Difference Learning with Linear Function Approximation

Authors:Xingang Guo, Bin Hu

View PDF

Abstract:In this paper, we consider the policy evaluation problem in multi-agent reinforcement learning (MARL) and derive exact closed-form formulas for the finite-time mean-squared estimation errors of decentralized temporal difference (TD) learning with linear function approximation. Our analysis hinges upon the fact that the decentralized TD learning method can be viewed as a Markov jump linear system (MJLS). Then standard MJLS theory can be applied to quantify the mean and covariance matrix of the estimation error of the decentralized TD method at every time step. Various implications of our exact formulas on the algorithm performance are also discussed. An interesting finding is that under a necessary and sufficient stability condition, the mean-squared TD estimation error will converge to an exact limit at a specific exponential rate.

Subjects:	Machine Learning (cs.LG); Systems and Control (eess.SY); Optimization and Control (math.OC)
Cite as:	arXiv:2204.09801 [cs.LG]
	(or arXiv:2204.09801v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2204.09801

Submission history

From: Xingang Guo [view email]
[v1] Wed, 20 Apr 2022 22:02:15 UTC (72 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2022-04

Change to browse by:

cs
cs.SY
eess
eess.SY
math
math.OC

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Exact Formulas for Finite-Time Estimation Errors of Decentralized Temporal Difference Learning with Linear Function Approximation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Exact Formulas for Finite-Time Estimation Errors of Decentralized Temporal Difference Learning with Linear Function Approximation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators