Dataset from ES-C51: Expected Sarsa Based C51 Distributional Reinforcement Learning Algorithm
收藏DataCite Commons2025-10-15 更新2026-05-03 收录
下载链接:
https://federation.figshare.com/articles/dataset/Dataset_from_ES-C51_Expected_Sarsa_Based_C51_Distributional_Reinforcement_Learning_Algorithm/30359872
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains the results of experiments comparing the performance of the standard Q-learning based distributional deep reinforcement learning algorithm QL-C51, and a novel variant which uses Expected-Sarsa temporal difference updates (ES-C51). Each algorithm was executed for 10 separate runs with independent seeds on 22 environments (Acrobot, Cartpole, and the Atari-10 environments with and without stochasticity). Results are reported for each run in terms of the mean episodic reward over the last 10% of learning episodes. Full details are in the corresponding paper.
提供机构:
Federation University Australia
创建时间:
2025-10-15



