Dataset from ES-C51: Expected Sarsa Based C51 Distributional Reinforcement Learning Algorithm

Name: Dataset from ES-C51: Expected Sarsa Based C51 Distributional Reinforcement Learning Algorithm
Creator: Federation University Australia
Published: 2025-10-15 00:15:53
License: 暂无描述

DataCite Commons2025-10-15 更新2026-05-03 收录

下载链接：

https://federation.figshare.com/articles/dataset/Dataset_from_ES-C51_Expected_Sarsa_Based_C51_Distributional_Reinforcement_Learning_Algorithm/30359872

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset contains the results of experiments comparing the performance of the standard Q-learning based distributional deep reinforcement learning algorithm QL-C51, and a novel variant which uses Expected-Sarsa temporal difference updates (ES-C51). Each algorithm was executed for 10 separate runs with independent seeds on 22 environments (Acrobot, Cartpole, and the Atari-10 environments with and without stochasticity). Results are reported for each run in terms of the mean episodic reward over the last 10% of learning episodes. Full details are in the corresponding paper.

提供机构：

Federation University Australia

创建时间：

2025-10-15

5,000+

优质数据集

54 个

任务类型

进入经典数据集