Jackrong/DeepSeek-V4-Distill-8000x

Name: Jackrong/DeepSeek-V4-Distill-8000x
Creator: Jackrong
Published: 2026-04-24 08:32:56
License: 暂无描述

Hugging Face2026-04-24 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/Jackrong/DeepSeek-V4-Distill-8000x

下载链接

链接失效反馈

官方服务：

资源简介：

DeepSeek-V4-Distill-8100x是一个用于推理导向蒸馏的监督微调数据集。问题提示来源于Jackrong/GLM-5.1-Reasoning-1M-Cleaned数据集，答案由教师模型DeepSeek-V4-Flash生成。经过清理后，发布的训练集包含7,716个高质量的JSONL示例。数据集格式为JSONL，包含对话式和直接输入/输出字段。主要用途包括推理导向的监督微调、蒸馏实验和格式转换实验。但需要注意，教师生成的响应可能包含事实错误、推理伪影或风格偏见。

DeepSeek-V4-Distill-8100x is a supervised fine-tuning dataset for reasoning-oriented distillation. The question prompts come from Jackrong/GLM-5.1-Reasoning-1M-Cleaned, and the answers were generated by the teacher model DeepSeek-V4-Flash. After the cleaning process, the released train split contains 7,716 high-quality JSONL examples. The dataset format is JSONL, including both conversation-style and direct input/output fields. It is primarily intended for reasoning-oriented supervised fine-tuning, distillation experiments, and format conversion experiments. However, it should be noted that the teacher-generated responses may contain factual errors, reasoning artifacts, or style biases.

提供机构：

Jackrong

5,000+

优质数据集

54 个

任务类型

进入经典数据集