Inference on Optimal Dynamic Policies via Softmax Approximation

Qizhao Chen; Morgane Austern; Vasilis Syrgkanis

Inference on Optimal Dynamic Policies via Softmax Approximation

Fiche du document

Auteurs

Date

8 mars 2023

Discipline

Economies et finances

Type de document

Textes imprimés

Périmètre

Publications

Identifiant

2303.04416

Source

arXiv - économie

Collection

arXiv

Organisation

Cornell University

Mots-clés Und

Economics - Econometrics Computer Science - Machine Learning Mathematics - Statistics Theory Statistics - Methodology

Sujets proches En

Induction, Ampliative Ampliative induction Inference (Logic)

Citer ce document

Qizhao Chen et al., « Inference on Optimal Dynamic Policies via Softmax Approximation », arXiv - économie

Partage / Export

Résumé 0

Estimating optimal dynamic policies from offline data is a fundamental problem in dynamic decision making. In the context of causal inference, the problem is known as estimating the optimal dynamic treatment regime. Even though there exists a plethora of methods for estimation, constructing confidence intervals for the value of the optimal regime and structural parameters associated with it is inherently harder, as it involves non-linear and non-differentiable functionals of unknown quantities that need to be estimated. Prior work resorted to sub-sample approaches that can deteriorate the quality of the estimate. We show that a simple soft-max approximation to the optimal treatment regime, for an appropriately fast growing temperature parameter, can achieve valid inference on the truly optimal regime. We illustrate our result for a two-period optimal dynamic regime, though our approach should directly extend to the finite horizon case. Our work combines techniques from semi-parametric inference and $g$-estimation, together with an appropriate triangular array central limit theorem, as well as a novel analysis of the asymptotic influence and asymptotic bias of softmax approximations.

Inference on Optimal Dynamic Policies via Softmax Approximation

Fiche du document

Mots-clés Und

Sujets proches En

Citer ce document

Partage / Export

Résumé 0

Par les mêmes auteurs

Sur les mêmes sujets

Sur les mêmes disciplines

Exporter en