Inference on Optimal Dynamic Policies via Softmax Approximation

Chen, Qizhao; Austern, Morgane; Syrgkanis, Vasilis

Economics > Econometrics

arXiv:2303.04416v3 (econ)

[Submitted on 8 Mar 2023 (v1), last revised 13 Dec 2023 (this version, v3)]

Title:Inference on Optimal Dynamic Policies via Softmax Approximation

Authors:Qizhao Chen, Morgane Austern, Vasilis Syrgkanis

View PDF

Abstract:Estimating optimal dynamic policies from offline data is a fundamental problem in dynamic decision making. In the context of causal inference, the problem is known as estimating the optimal dynamic treatment regime. Even though there exists a plethora of methods for estimation, constructing confidence intervals for the value of the optimal regime and structural parameters associated with it is inherently harder, as it involves non-linear and non-differentiable functionals of unknown quantities that need to be estimated. Prior work resorted to sub-sample approaches that can deteriorate the quality of the estimate. We show that a simple soft-max approximation to the optimal treatment regime, for an appropriately fast growing temperature parameter, can achieve valid inference on the truly optimal regime. We illustrate our result for a two-period optimal dynamic regime, though our approach should directly extend to the finite horizon case. Our work combines techniques from semi-parametric inference and $g$-estimation, together with an appropriate triangular array central limit theorem, as well as a novel analysis of the asymptotic influence and asymptotic bias of softmax approximations.

Subjects:	Econometrics (econ.EM); Machine Learning (cs.LG); Statistics Theory (math.ST); Methodology (stat.ME)
Cite as:	arXiv:2303.04416 [econ.EM]
	(or arXiv:2303.04416v3 [econ.EM] for this version)
	https://doi.org/10.48550/arXiv.2303.04416

Submission history

From: Qizhao Chen [view email]
[v1] Wed, 8 Mar 2023 07:42:47 UTC (762 KB)
[v2] Fri, 7 Apr 2023 20:08:09 UTC (764 KB)
[v3] Wed, 13 Dec 2023 23:26:48 UTC (796 KB)

Economics > Econometrics

Title:Inference on Optimal Dynamic Policies via Softmax Approximation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Economics > Econometrics

Title:Inference on Optimal Dynamic Policies via Softmax Approximation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators