five

Dataset from ES-C51: Expected Sarsa Based C51 Distributional Reinforcement Learning Algorithm

收藏
DataCite Commons2025-10-15 更新2026-05-03 收录
下载链接:
https://federation.figshare.com/articles/dataset/Dataset_from_ES-C51_Expected_Sarsa_Based_C51_Distributional_Reinforcement_Learning_Algorithm/30359872
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains the results of experiments comparing the performance of the standard Q-learning based distributional deep reinforcement learning algorithm QL-C51, and a novel variant which uses Expected-Sarsa temporal difference updates (ES-C51). Each algorithm was executed for 10 separate runs with independent seeds on 22 environments (Acrobot, Cartpole, and the Atari-10 environments with and without stochasticity). Results are reported for each run in terms of the mean episodic reward over the last 10% of learning episodes. Full details are in the corresponding paper.
提供机构:
Federation University Australia
创建时间:
2025-10-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作