TempoRL-experiment data

Name: TempoRL-experiment data
Creator: figshare
Published: 2023-05-15 12:38:53
License: 暂无描述

DataCite Commons2023-05-15 更新2024-08-18 收录

下载链接：

https://figshare.com/articles/dataset/TempoRL-experiment_data/13638023

下载链接

链接失效反馈

官方服务：

资源简介：

Learning curve data, as well as commands to generate the data and final trained policies of all experiments of the TempoRL paper. Readme.md -> Contains description of the folder structures in all three archivesexperiments.tar.gz -> Archive containing all experiment results with the following folders: atari -> Contains all results on Atari games featurized_results -> Contains results on LunarLander-v2 and MountainCar-v0 tabular_results -> Contains results for the tabular experiments. DDPG results have been added in a separate zip tempoCar1.mp4 and tempoCar2.mp4 show learned policies of tempoRL in the mountain car environment. The agent blinks bright blue when it decides to make a new decision.tempoQBert.mp4 shows a tempoRL policy in action on QBert.

本数据集涵盖TempoRL论文所有实验的学习曲线数据、数据生成命令以及最终训练完成的智能体策略。 Readme.md：用于说明三个归档文件的文件夹结构。 experiments.tar.gz：包含全部实验结果，内含以下子文件夹： atari：收录雅达利（Atari）游戏实验的全部结果； featurized_results：收录月球着陆器-v2（LunarLander-v2）与山地车-v0（MountainCar-v0）实验的结果； tabular_results：收录表格型强化学习实验的结果。深度确定性策略梯度（Deep Deterministic Policy Gradient, DDPG）的实验结果已单独打包为压缩文件。 tempoCar1.mp4与tempoCar2.mp4：展示TempoRL在山地车环境中学到的运行策略，智能体在决定进行新决策时会闪烁亮蓝色。 tempoQBert.mp4：展示TempoRL策略在QBert游戏中的实际运行效果。

提供机构：

figshare

创建时间：

2023-05-15

5,000+

优质数据集

54 个

任务类型

进入经典数据集