five

WOMD-Normal

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/cmubig/SEAL
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集专为强化学习(RL)代理策略的闭环训练而设计,涉及自我代理和对抗代理。在训练过程中,采用了课程训练方法;代理通过模拟的激光雷达返回观察环境。该数据集的训练规模达到一百万时间步,任务是进行RL代理的闭环训练。

This dataset is specifically designed for closed-loop training of reinforcement learning (RL) agent policies, involving self-play agents and adversarial agents. A curriculum training approach is adopted during the training process, where agents observe the environment through simulated LiDAR returns. The training scale of this dataset reaches one million time steps, and the core task is closed-loop training for RL agents.
提供机构:
MetaDrive
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作