five

SEAC Training Data

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/alpaficia/SEAC_Pytorch_release
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是由使用软弹性演员-评论家(SEAC)算法进行的实验生成的,比较了该算法与原始软演员-评论家(SAC)算法和近端策略优化(PPO)算法的性能。数据集包含了SEAC算法相较于SAC和PPO算法的平均回报、每项任务的时间成本以及能源消耗等指标。训练规模涉及大约90万至120万步,任务是强化学习,用于控制模拟环境中的智能体运动。

This dataset is generated from experiments conducted with the Soft Elastic Actor-Critic (SEAC) algorithm, which compares the performance of SEAC against the original Soft Actor-Critic (SAC) and Proximal Policy Optimization (PPO) algorithms. The dataset includes metrics such as the average return of SEAC relative to SAC and PPO, the time cost per individual task, and energy consumption. The training scale ranges from approximately 900,000 to 1,200,000 steps, and the tasks are reinforcement learning tasks focused on controlling agent movement in simulated environments.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作