SYNTHETIC-2-SFT-verified
收藏魔搭社区2026-01-02 更新2025-07-12 收录
下载链接:
https://modelscope.cn/datasets/PrimeIntellect/SYNTHETIC-2-SFT-verified
下载链接
链接失效反馈官方服务:
资源简介:
# SYNTHETIC-2
SYNTHETIC-2 is an open reasoning dataset spanning a variety of math, coding and general reasoning tasks along with reasoning traces generated in a collaborative manner. The dataset contains both high quality reasoning traces from Deepseek-R1-0528 ideally suited for SFT, as well as multiple reasoning traces from smaller models which can be used for difficulty estimation.
To read more about our data collection approach, check out our [blog post](https://www.primeintellect.ai/blog/synthetic-2-release).

We release the following final dataset splits on Huggingface:
- [SYNTHETIC-2](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-2): The full SYNTHETIC-2 dataset consisting of all prompts and completions along with rewards
- [SYNTHETIC-2-SFT-verified](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-2-SFT-verified): The SFT split of SYNTHETIC-2 with responses from Deepseek-R1-0528 verified as correct (rewards of 1 for binary rewards and over 0.7 for non-binary rewards)
- [SYNTHETIC-2-SFT-unverified](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-2-SFT-unverified): The SFT split of SYNTHETIC-2 with all responses, including those not verified as correct
- [SYNTHETIC-2-RL](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-2-RL): The RL subset of SYNTHETIC-2 with difficulty annotations from Qwen3-32B, Qwen3-4B and DeepSeek-R1-0528-Qwen3-8B
# SYNTHETIC-2
SYNTHETIC-2 是一款开源推理数据集,涵盖多类数学、编程与通用推理任务,并附带以协作方式生成的推理轨迹(reasoning traces)。该数据集既包含源自Deepseek-R1-0528的高质量推理轨迹,此类轨迹非常适用于监督微调(Supervised Fine-Tuning,简称SFT),同时也收录了来自多个小型模型的推理轨迹,可用于任务难度评估。
如需了解更多数据集采集方案的细节,请查阅我们的[博客文章](https://www.primeintellect.ai/blog/synthetic-2-release)。

我们在Huggingface平台发布了以下最终数据集划分子集:
- [SYNTHETIC-2](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-2):完整的SYNTHETIC-2数据集,涵盖所有提示词(prompt)、模型补全结果与奖励分值
- [SYNTHETIC-2-SFT-verified](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-2-SFT-verified):SYNTHETIC-2的监督微调划分子集,其中源自Deepseek-R1-0528的模型回复均经过正确性验证(二元奖励分值设为1,非二元奖励分值大于0.7)
- [SYNTHETIC-2-SFT-unverified](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-2-SFT-unverified):SYNTHETIC-2的监督微调划分子集,包含所有模型回复,其中也包括未经过正确性验证的内容
- [SYNTHETIC-2-RL](https://huggingface.co/datasets/PrimeIntellect/SYNTHETIC-2-RL):SYNTHETIC-2的强化学习(Reinforcement Learning,简称RL)子集,包含来自Qwen3-32B、Qwen3-4B以及DeepSeek-R1-0528-Qwen3-8B的任务难度标注信息
提供机构:
maas
创建时间:
2025-07-10



