Jackrong/DeepSeek-V4-Distill-8000x
收藏Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Jackrong/DeepSeek-V4-Distill-8000x
下载链接
链接失效反馈官方服务:
资源简介:
DeepSeek-V4-Distill-8100x是一个用于推理导向蒸馏的监督微调数据集。问题提示来源于Jackrong/GLM-5.1-Reasoning-1M-Cleaned数据集,答案由教师模型DeepSeek-V4-Flash生成。经过清理后,发布的训练集包含7,716个高质量的JSONL示例。数据集格式为JSONL,包含对话式和直接输入/输出字段。主要用途包括推理导向的监督微调、蒸馏实验和格式转换实验。但需要注意,教师生成的响应可能包含事实错误、推理伪影或风格偏见。
DeepSeek-V4-Distill-8100x is a supervised fine-tuning dataset for reasoning-oriented distillation. The question prompts come from Jackrong/GLM-5.1-Reasoning-1M-Cleaned, and the answers were generated by the teacher model DeepSeek-V4-Flash. After the cleaning process, the released train split contains 7,716 high-quality JSONL examples. The dataset format is JSONL, including both conversation-style and direct input/output fields. It is primarily intended for reasoning-oriented supervised fine-tuning, distillation experiments, and format conversion experiments. However, it should be noted that the teacher-generated responses may contain factual errors, reasoning artifacts, or style biases.
提供机构:
Jackrong



