step_sft
收藏魔搭社区2025-12-05 更新2025-02-15 收录
下载链接:
https://modelscope.cn/datasets/xiaodongguaAIGC/step_sft
下载链接
链接失效反馈官方服务:
资源简介:
Mix:
| 数据集名称 | 是否有step | 可用于PRM训练 | 标签形式 | Title | 备注 |
| ------------- | ---------- | ------------- | ------------ | ------------------------------------------------------------ | -------------------- |
| GSM8K | ✅ | ❌ | 答案 | Training Verifiers to Solve Math Word Problems | |
| MATH | ❌ | ❌ | 答案 | Measuring Mathematical Problem Solving With the MATH Dataset | Non-Step |
| PRM800K | ✅ | ✅ | 正确类别 | Let's Verify Step by Step | prompt deduplication |
| Math-Shepherd | ✅ | ✅ | 正确类别 | Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations | Not used |
| ProcessBench | ✅ | ✅ | 首个错误步骤 | ProcessBench: Identifying Process Errors in Mathematical Reasoning | only label -1 |
数据集汇总:
| 数据集名称 | 是否包含推理步骤 | 可用于PRM训练 | 标签形式 | 论文标题 | 备注 |
| ------------- | -------------- | ------------ | ------------ | ------------------------------------------------------------ | -------------------- |
| GSM8K | ✅ | ❌ | 答案 | 《训练验证器以求解数学应用题》(Training Verifiers to Solve Math Word Problems) | |
| MATH | ❌ | ❌ | 答案 | 《基于MATH数据集评估数学问题求解能力》(Measuring Mathematical Problem Solving With the MATH Dataset) | 非分步式 |
| PRM800K | ✅ | ✅ | 正确类别 | 《逐步验证》(Let's Verify Step by Step) | 提示词去重 |
| Math-Shepherd | ✅ | ✅ | 正确类别 | 《Math-Shepherd:无需人工标注即可逐步验证并强化大语言模型(LLMs)的推理能力》(Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations) | 未使用 |
| ProcessBench | ✅ | ✅ | 首个错误步骤 | 《ProcessBench:识别数学推理中的过程性错误》(ProcessBench: Identifying Process Errors in Mathematical Reasoning) | 仅标注-1类错误 |
提供机构:
maas
创建时间:
2025-02-13



