five

zake7749/Qwen3.5-27B-DeepCoder-SFT

收藏
Hugging Face2026-04-08 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/zake7749/Qwen3.5-27B-DeepCoder-SFT
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en license: cc-by-4.0 task_categories: - text-generation tags: - code - competitive-programming - rejection-sampling - sft - reasoning - python pretty_name: Qwen3.5-27B DeepCoder SFT size_categories: - 10K<n<100K --- ## Overview This dataset contains **14,683 verified code solutions** with chain-of-thought reasoning, generated by `Qwen3.5-27B` on competitive programming problems. Each example has been executed against test cases in a sandboxed environment and passes 100% of tests. Key features: - **Reasoning traces** — 77.5% of examples include step-by-step reasoning before the final code solution - **Verified correctness** — every solution passes all test cases (up to 30 per problem) - **Rejection sampled** — 8 candidates generated per problem, only passing solutions retained --- ## Dataset Statistics | Metric | Value | |--------|-------| | Total examples | 14,683 | | Source | DeepCoder (100%) | | Problems covered | ~4,700 | | With reasoning | 11,379 (77.5%) | | Without reasoning (code only) | 3,304 (22.5%) | | Test cases per problem | mean 29.5, median 30 | --- ## Field Descriptions | Field | Type | Description | |-------|------|-------------| | `id` | string | Unique identifier: `{problem_id}_sft_{candidate_index}` | | `problem` | string | Full problem statement in English | | `reasoning` | string | Chain-of-thought reasoning before the solution (empty for 22.5% of examples) | | `answer` | string | Final answer containing the code block | | `response` | string | Complete raw model output (reasoning + answer combined, for backward compatibility) | | `solution` | string | Extracted Python code from the answer | | `test_pass_rate` | float | Always 1.0 (all tests passed) | | `tests_passed` | int | Number of test cases passed | | `tests_total` | int | Total number of test cases executed | | `source` | string | Data source (`"deepcoder"`) | --- ## Usage ```python from datasets import load_dataset ds = load_dataset("zake7749/Qwen3.5-27B-OpenCode-SFT", split="train") # Only examples with reasoning with_reasoning = ds.filter(lambda x: len(x["reasoning"]) > 0) print(f"With reasoning: {len(with_reasoning)}") # Access fields example = ds[0] print(example["reasoning"]) # chain-of-thought print(example["solution"]) # extracted Python code ``` --- ## Related Datasets - [zake7749/Qwen3-Coder-Next-OpenCode-SFT](https://huggingface.co/datasets/zake7749/Qwen3-Coder-Next-OpenCode-SFT) — 49,374 SFT examples from Qwen3-Coder-Next - [zake7749/Qwen3-Coder-Next-OpenCode-Preference](https://huggingface.co/datasets/zake7749/Qwen3-Coder-Next-OpenCode-Preference) — 10,920 preference pairs for DPO/KTO ## Call for Collaboration We are actively looking for **compute sponsors and research collaborators** to scale up this work. Our roadmap includes: - **Scaling data generation** — more problems, more candidates per problem, and broader language coverage - **SFT & RL training** — training and open-sourcing code reasoning models using this dataset and its SFT counterpart - **Expanding the pipeline** — supporting more base models beyond Qwen3-Coder-Next We welcome all forms of collaboration: - **Compute sponsorship** — GPU hours for large-scale generation and training runs - **Joint research** — co-developing models, co-authoring technical reports ## License Source data licenses should be verified before redistribution: - **DeepCoder** (`agentica-org/DeepCoder-Preview-Dataset`): Check HuggingFace dataset card ## Citation ```bibtex @misc{yang_qwen35_opencode_sft_2026, title={Qwen3.5-27B-OpenCode-SFT}, author={Yang, Kai-Chou}, year={2026}, publisher={Hugging Face}, url={https://huggingface.co/datasets/zake7749/Qwen3.5-27B-OpenCode-SFT/} } ```

--- language: - 英语 license: CC BY 4.0 task_categories: - 文本生成 tags: - 代码 - 编程竞赛 - 拒绝采样 - 监督微调(Supervised Fine-Tuning,SFT) - 推理 - Python pretty_name: Qwen3.5-27B DeepCoder 监督微调数据集 size_categories: - 10000 < 样本数 < 100000 --- ## 数据集概览 本数据集包含**14683份经过验证的代码解决方案**,附带思维链(Chain-of-Thought,CoT)推理过程,由`Qwen3.5-27B`针对编程竞赛题目生成。所有样本均在沙箱环境中针对测试用例执行,且100%通过所有测试。 核心特性: - **推理轨迹**:77.5%的样本在最终代码解决方案前包含逐步推理过程 - **正确性验证**:每份解决方案均通过所有测试用例(单题最多包含30个测试用例) - **拒绝采样**:每道题目生成8个候选方案,仅保留通过测试的解决方案 ## 数据集统计信息 | 指标 | 数值 | | ---- | ---- | | 总样本数 | 14683 | | 数据来源 | DeepCoder(100%) | | 覆盖题目数 | 约4700道 | | 带推理过程样本 | 11379份(占比77.5%) | | 无推理过程样本(仅含代码) | 3304份(占比22.5%) | | 单题测试用例数 | 均值29.5,中位数30 | ## 字段说明 | 字段 | 数据类型 | 描述 | | ---- | ---- | ---- | | `id` | 字符串 | 唯一标识符,格式为`{problem_id}_sft_{candidate_index}` | | `problem` | 字符串 | 英文完整题目描述 | | `reasoning` | 字符串 | 解决方案前的思维链推理过程,22.5%的样本此字段为空 | | `answer` | 字符串 | 包含代码块的最终答案 | | `response` | 字符串 | 模型完整原始输出(推理过程与答案的组合,用于向后兼容) | | `solution` | 字符串 | 从答案中提取的Python代码 | | `test_pass_rate` | 浮点数 | 固定为1.0(所有测试均通过) | | `tests_passed` | 整数 | 通过的测试用例数量 | | `tests_total` | 整数 | 执行的测试用例总数 | | `source` | 字符串 | 数据来源,固定为`"deepcoder"` | ## 使用方法 python from datasets import load_dataset ds = load_dataset("zake7749/Qwen3.5-27B-OpenCode-SFT", split="train") # 仅筛选包含推理过程的样本 with_reasoning = ds.filter(lambda x: len(x["reasoning"]) > 0) print(f"包含推理过程的样本数:{len(with_reasoning)}") # 访问字段内容 example = ds[0] print(example["reasoning"]) # 思维链推理内容 print(example["solution"]) # 提取的Python代码 ## 相关数据集 - [zake7749/Qwen3-Coder-Next-OpenCode-SFT](https://huggingface.co/datasets/zake7749/Qwen3-Coder-Next-OpenCode-SFT):来自Qwen3-Coder-Next的49374条监督微调(SFT)样本 - [zake7749/Qwen3-Coder-Next-OpenCode-Preference](https://huggingface.co/datasets/zake7749/Qwen3-Coder-Next-OpenCode-Preference):用于直接偏好优化(Direct Preference Optimization,DPO)与知识增强时序优化(Knowledge-based Temporal Optimization,KTO)的10920条偏好配对样本 ## 合作招募 我们正积极寻求**算力赞助方与研究合作者**,以推进该项目的规模化发展。我们的路线图包含: - **扩大数据生成规模**:覆盖更多题目、增加单题候选方案数量、拓展语言覆盖范围 - **监督微调与强化学习训练**:使用本数据集及其配套监督微调数据集训练并开源代码推理模型 - **扩展流程管线**:支持Qwen3-Coder-Next以外的更多基础模型 我们欢迎各类形式的合作: - **算力赞助**:为大规模生成与训练任务提供GPU算力时长 - **联合研究**:共同开发模型、共同撰写技术报告 ## 许可证 重新分发本数据集前,请验证源数据的许可证: - **DeepCoder**(`agentica-org/DeepCoder-Preview-Dataset`):请查看Hugging Face数据集卡片获取详细信息 ## 引用格式 bibtex @misc{yang_qwen35_opencode_sft_2026, title={Qwen3.5-27B-OpenCode-SFT}, author={Yang, Kai-Chou}, year={2026}, publisher={Hugging Face}, url={https://huggingface.co/datasets/zake7749/Qwen3.5-27B-OpenCode-SFT/} }
提供机构:
zake7749
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作