TSEC: a framework for online experimentation under experimental constraints
收藏DataCite Commons2023-01-11 更新2024-08-18 收录
下载链接:
https://tandf.figshare.com/articles/dataset/TSEC_a_framework_for_online_experimentation_under_experimental_constraints/21131405/1
下载链接
链接失效反馈官方服务:
资源简介:
Thompson sampling is a popular algorithm for tackling multi-armed bandit problems, and has been applied in a wide range of applications, from website design to portfolio optimization. In such applications, however, the number of choices (or arms) <i>N</i> can be large, and the data needed to make adaptive decisions require expensive experimentation. One is then faced with the constraint of experimenting on only a small subset of K≪N arms within each time period, which poses a problem for traditional Thompson sampling. We propose a new Thompson Sampling under Experimental Constraints (TSEC) method, which addresses this so-called “arm budget constraint”. TSEC makes use of a Bayesian interaction model with effect hierarchy priors, to model correlations between rewards on different arms. This fitted model is then integrated within Thompson sampling, to jointly identify a good subset of arms for experimentation and to allocate resources over these arms. We demonstrate the effectiveness of TSEC in two applications with arm budget constraints. The first is a simulated website optimization study, where TSEC shows considerable improvements over industry benchmarks. The second is a portfolio optimization application on industry-based exchange-traded funds, where TSEC provides more consistent and greater wealth accumulation over standard investment strategies.
提供机构:
Taylor & Francis
创建时间:
2022-09-16



