Data from: Risk-aware multi-armed bandit problem with application to portfolio selection
收藏DataCite Commons2025-06-01 更新2025-06-15 收录
下载链接:
https://datadryad.org/dataset/doi:10.5061/dryad.h628h
下载链接
链接失效反馈官方服务:
资源简介:
Sequential portfolio selection has attracted increasing interests in the
machine learning and quantitative finance communities in recent years. As
a mathematical framework for reinforcement learning policies, the
stochastic multi-armed bandit problem addresses the primary difficulty in
sequential decision making under uncertainty, namely the exploration
versus exploitation dilemma, and therefore provides a natural connection
to portfolio selection. In this paper, we incorporate risk-awareness into
the classic multi-armed bandit setting and introduce an algorithm to
construct portfolio. Through filtering assets based on the topological
structure of financial market and combining the optimal multi-armed bandit
policy with the minimization of a coherent risk measure, we achieve a
balance between risk and return.
提供机构:
Dryad
创建时间:
2017-10-17



