Model Selection for Multivalued-Treatment Policy Learning in Observational Studies
收藏Taylor & Francis Group2025-02-03 更新2026-04-16 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Model_Selection_for_Multivalued-Treatment_Policy_Learning_in_Observational_Studies_/28057259
下载链接
链接失效反馈官方服务:
资源简介:
This study investigates the policy learning problem in observational studies, where the treatment variable can be multivalued and the propensity scores are unknown. We approximate the optimal policy in a global policy class with infinite complexity (VC/Natarajan) dimension, using a sequence of sieve policy classes with finite complexity dimension. The optimal policy within each sieve class is estimated by maximizing the empirical welfare, constructed through the doubly robust moment condition and cross-fitting method. To select the suitable sieve space, we maximize the penalized empirical welfare, with the penalty determined by either the Rademacher complexity or a holdout method. We establish oracle inequalities that demonstrate the bias and variance tradeoff achieved by the data-driven policy estimator. We also investigate two specific sieve selections: (a) a monotone single index model and (b) a systematic discretization method, which uses conventional sieve results for smooth functions such as linear sieves and deep neural networks. In the empirical study, we apply our method to examine the policy of assigning individuals to job training of different lengths.
提供机构:
Xie, Haitian; Xi, Jin; Fang, Yue
创建时间:
2025-02-03



