five

amd/ReasonLite-Dataset

收藏
Hugging Face2026-01-22 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/amd/ReasonLite-Dataset
下载链接
链接失效反馈
官方服务:
资源简介:
ReasonLite数据集包含从Polaris和OpenMathReasoning收集的343K数学问题。使用GPT-OSS作为教师模型,在中高推理模式下生成了9.1M原始答案,并通过多数投票生成伪标签,最终保留了6.1M样本。数据集分为Short CoT(4.3M)和Long CoT(1.8M)两部分。

The ReasonLite dataset consists of 343K math problems collected from Polaris and OpenMathReasoning. Using GPT-OSS as the teacher model, 9.1M raw answers were generated under medium and high reasoning modes. Pseudo-labels were produced via majority voting, and 6.1M samples were retained. The dataset is divided into Short CoT (4.3M) and Long CoT (1.8M) parts.
提供机构:
amd
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作