five

Strategy DTMCs from Q-learning Algorithms

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/rajarshi008/PriTL
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了在随机环境中,特别是OpenAI Gym的冰湖环境里,通过Q-learning算法生成的最优和次优策略的策略DTMCs。该数据集不仅包括正确的LTL任务,也包括错误的LTL任务,每个任务生成了10个正面的最优策略DTMCs和10个负面的最优策略DTMCs,这有助于评估PriTL学习算法。数据集的规模达到了累计状态空间的10^3数量级,其任务是推断策略DTMCs中的简洁LTL规格说明。

This dataset contains strategy DTMCs of optimal and suboptimal policies generated via the Q-learning algorithm in random environments, especially the Frozen Lake environment from OpenAI Gym. It includes both correct and incorrect LTL tasks, with 10 positive optimal policy DTMCs and 10 negative optimal policy DTMCs generated for each task, which facilitates the evaluation of the PriTL learning algorithm. The cumulative state space of the dataset is on the order of 10^3, and its task is to infer concise LTL specifications from the strategy DTMCs.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作