GridWorld MDP

arXiv2025-09-30 收录

下载链接：

https://github.com/ddv-lab/robust-cf-inference-in-MDPs

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个4x4的网格环境，其中智能体必须从左上角出发，避开中心位置的危险终止状态，最终到达右下角的目标状态。智能体可以向上、下、左或右移动，但存在一定概率的意外移动。此外，该环境包含两个版本，分别具有不同的意外移动概率（分别为0.9和0.4）。在4x4的网格规模下，任务目标是最大化奖励同时避免惩罚。

This dataset is a 4x4 grid-world environment where an AI Agent must start from the top-left corner, avoid the hazardous terminal state located at the center, and ultimately reach the goal state at the bottom-right corner. The agent can move up, down, left, or right, but there is a certain probability of unintended movement during execution. Additionally, this environment has two variants with different unintended movement probabilities: 0.9 and 0.4, respectively. For this 4x4 grid setup, the task objective is to maximize rewards while avoiding penalties.

5,000+

优质数据集

54 个

任务类型

进入经典数据集