five

nortem/marl-gpt-datasets

收藏
Hugging Face2026-03-02 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/nortem/marl-gpt-datasets
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit pretty_name: MARL-GPT Datasets tags: - reinforcement-learning - multi-agent - offline-rl - trajectories task_categories: - reinforcement-learning --- # MARL-GPT Datasets Offline expert trajectories from **“MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning”**. ## Environments This dataset includes trajectories from the three evaluation domains used in MARL-GPT: **SMACv2** (StarCraft multi-agent combat), **Google Research Football (GRF)**, and **POGEMA** (partially observable multi-agent pathfinding on grids). ## Format Trajectories are stored sequentially (no shuffling). Use the `done` flag to split the stream into per-agent trajectory segments. Each transition is: - `(obs, act, rew, done)` Training-ready data is provided as a flattened dict of tensors: - `obs`: `(B, n_obs_feat)` - `act`: `(B,)` - `rew`: `(B,)` - `done`: `(B,)` (optional) - `info_battle_won`: `(B,)` (optional) - `action_mask`: `(B, n_act)` (optional) `B` is the number of transitions in the flattened stream.
提供机构:
nortem
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作