Stochastic MDP
收藏DataCite Commons2024-12-16 更新2025-04-16 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/fd61e9bd-3e79-4384-a29e-5a29ddea5d43
下载链接
链接失效反馈官方服务:
资源简介:
The dataset used in this paper is a stochastic MDP with |S| = 4 and |A| = 4. One of the states is set to the terminal state, and one of the rest is set to the starting state. The transition probability and reward functions are randomly generated.
提供机构:
TIB
创建时间:
2024-12-16



