Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation

Peng, Yang; Jin, Kaicheng; Zhang, Liangyu; Zhang, Zhihua

Statistics > Machine Learning

arXiv:2502.14172 (stat)

[Submitted on 20 Feb 2025]

Title:Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation

Authors:Yang Peng, Kaicheng Jin, Liangyu Zhang, Zhihua Zhang

View PDF HTML (experimental)

Abstract:In this paper, we investigate the finite-sample statistical rates of distributional temporal difference (TD) learning with linear function approximation. The aim of distributional TD learning is to estimate the return distribution of a discounted Markov decision process for a given policy {\pi}. Prior works on statistical analysis of distributional TD learning mainly focus on the tabular case. In contrast, we first consider the linear function approximation setting and derive sharp finite-sample rates. Our theoretical results demonstrate that the sample complexity of linear distributional TD learning matches that of the classic linear TD learning. This implies that, with linear function approximation, learning the full distribution of the return using streaming data is no more difficult than learning its expectation (i.e. the value function). To derive tight sample complexity bounds, we conduct a fine-grained analysis of the linear-categorical Bellman equation, and employ the exponential stability arguments for products of random matrices. Our findings provide new insights into the statistical efficiency of distributional reinforcement learning algorithms.

Comments:	57 pages
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2502.14172 [stat.ML]
	(or arXiv:2502.14172v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2502.14172

Submission history

From: Yang Peng [view email]
[v1] Thu, 20 Feb 2025 00:53:22 UTC (48 KB)

Statistics > Machine Learning

Title:Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators