TempoRL-experiment data
收藏DataCite Commons2023-05-15 更新2024-08-26 收录
下载链接:
https://figshare.com/articles/dataset/TempoRL-experiment_data/13638023/1
下载链接
链接失效反馈官方服务:
资源简介:
Learning curve data, as well as commands to generate the data and final trained policies of all experiments of the TempoRL paper.<br><b>Readme.md</b> -> Contains description of the folder structures in all three archivesexperiments.tar.gz -> Archive containing all experiment results with the following folders:<br><b> atari</b> -> Contains all results on Atari games<b> featurized_results</b> -> Contains results on LunarLander-v2 and MountainCar-v0<b> tabular_results</b> -> Contains results for the tabular experiments.<br>DDPG results have been added in a separate zip<br><b>tempoCar1.mp4</b> and <b>tempoCar2.mp4</b> show learned policies of tempoRL in the mountain car environment. The agent blinks bright blue when it decides to make a new decision.<b>tempoQBert.mp4</b> shows a tempoRL policy in action on QBert.<br><br>
提供机构:
figshare
创建时间:
2023-05-15



