Yang-Zhou/DAPO-Math-17k-Qwen3-235B-A22B-Thinking-2507-rejection-distill

Name: Yang-Zhou/DAPO-Math-17k-Qwen3-235B-A22B-Thinking-2507-rejection-distill
Creator: Yang-Zhou
Published: 2025-11-04 06:23:12
License: 暂无描述

Hugging Face2025-11-04 更新2025-10-25 收录

下载链接：

https://hf-mirror.com/datasets/Yang-Zhou/DAPO-Math-17k-Qwen3-235B-A22B-Thinking-2507-rejection-distill

下载链接

链接失效反馈

官方服务：

资源简介：

DAPO-Math-17k-Qwen3-235B-A22B-Thinking-2507-rejection-distill是一个高质量的Chain-of-Thought数据集，使用Qwen/Qwen3-235B-A22B-Thinking-2507模型基于DAPO-Math-17k数据集生成，并经过拒绝抽样筛选。适用于SFT蒸馏训练，用于提升模型的数学推理能力。

DAPO-Math-17k-Qwen3-235B-A22B-Thinking-2507-rejection-distill is a high-quality Chain-of-Thought dataset generated using the Qwen/Qwen3-235B-A22B-Thinking-2507 model based on the DAPO-Math-17k dataset with rejection sampling. It is suitable for SFT distillation training to enhance the mathematical reasoning capabilities of models.

提供机构：

Yang-Zhou

5,000+

优质数据集

54 个

任务类型

进入经典数据集