Toy Example Dataset
收藏DataCite Commons2026-01-07 更新2026-05-05 收录
下载链接:
https://service.tib.eu/ldmservice/dataset/a508864e-f6e3-41dc-96b5-9b2ae173d28b
下载链接
链接失效反馈官方服务:
资源简介:
The dataset used in the paper is a toy example, consisting of a 10x10 grid world, with the agent at position (0, 0). Obstacles are randomly positioned, at an obstacle to free position ratio of 0.2. The agent is presented a plan π (an action sequence) of 10 movements (up, down, left, right, with obvious semantics). The agent has a Bernoulli action failure probability pfail uniformly sampled from [0; 1]. Action failure results in the inverse movement (e.g. failing up yields down). The agent is presented a number of observations about its failure probability before running BV.
提供机构:
TIB
创建时间:
2024-12-16



