five

stevenkhan/pokemon-showdown-battle-sft

收藏
Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/stevenkhan/pokemon-showdown-battle-sft
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个包含500,057个来自高ELO(1400+)竞技Pokémon Showdown比赛的专业战斗决策的数据集,用于监督微调语言模型。数据集将原始的Pokémon Showdown对战日志转换为指令跟随对,其中用户消息描述当前战斗状态,助手消息推荐最优行动并附带战略推理。数据集仅包含获胜玩家的决策,模型从成功中学习。数据来源自milkkarten/pokechamp,经过过滤(ELO≥1400、有赢家的游戏、每场游戏≥3个有意义的回合、仅获胜玩家的视角)。数据集覆盖多个世代和格式,包括OU、RU、Ubers、National Dex、Random Battles等。每个示例都是一个聊天消息列表,包含系统、用户和助手角色。战斗状态跟踪包括活跃Pokémon、状态条件、统计提升、已知招式、能力、物品、Mega进化/Terastallization、替补Pokémon、侧面条件、场地条件、世代和格式等。

A dataset containing 500,057 expert battle decisions from high-ELO (1400+) competitive Pokémon Showdown matches, formatted for supervised fine-tuning of language models. The dataset converts raw Pokémon Showdown replay logs into instruction-following pairs where user messages describe the current battle state and assistant messages recommend the optimal action with strategic reasoning. Only the winning players decisions are included — the model learns from success. The source data is processed from milkkarten/pokechamp, filtered by ELO ≥ 1400, games with a winner, ≥3 meaningful turns per game, and winning players perspective only. The dataset covers multiple generations and formats including OU, RU, Ubers, National Dex, Random Battles, etc. Each example is a list of chat messages with system, user, and assistant roles. Battle state tracking includes active Pokémon, status conditions, stat boosts, known moves, abilities, items, Mega Evolution/Terastallization, bench Pokémon, side conditions, field conditions, generation, and format.
提供机构:
stevenkhan
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作