Decomposition methods for solving Markov decision processes with multiple models of the parameters
收藏Figshare2021-01-19 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Decomposition_Methods_for_Solving_Markov_Decision_Processes_with_Multiple_Models_of_the_Parameters/13607779
下载链接
链接失效反馈官方服务:
资源简介:
We consider the problem of decision-making in Markov decision processes (MDPs) when the reward or transition probability parameters are not known with certainty. We study an approach in which the decision maker considers multiple models of the parameters for an MDP and wishes to find a policy that optimizes an objective function that considers the performance with respect to each model, such as maximizing the expected performance or maximizing worst-case performance. Existing solution methods rely on mixed-integer program (MIP) formulations, but have previously been limited to small instances, due to the computational complexity. In this article, we present branch-and-cut and policy-based branch-and-bound (PB-B&B) solution methods that leverage the decomposable structure of the problem and allow for the solution of MDPs that consider many models of the parameters. Numerical experiments show that a customized implementation of PB-B&B significantly outperforms the MIP-based solution methods and that the variance among model parameters can be an important factor in the value of solving these problems.
创建时间:
2021-01-19



