Toy Example Dataset

DataCite Commons2026-01-07 更新2026-05-05 收录

下载链接：

https://service.tib.eu/ldmservice/dataset/a508864e-f6e3-41dc-96b5-9b2ae173d28b

下载链接

链接失效反馈

官方服务：

资源简介：

The dataset used in the paper is a toy example, consisting of a 10x10 grid world, with the agent at position (0, 0). Obstacles are randomly positioned, at an obstacle to free position ratio of 0.2. The agent is presented a plan π (an action sequence) of 10 movements (up, down, left, right, with obvious semantics). The agent has a Bernoulli action failure probability pfail uniformly sampled from [0; 1]. Action failure results in the inverse movement (e.g. failing up yields down). The agent is presented a number of observations about its failure probability before running BV.

提供机构：

TIB

创建时间：

2024-12-16

5,000+

优质数据集

54 个

任务类型

进入经典数据集