five

Jackrong/DeepSeek-V4-Distill-8000x

收藏
Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Jackrong/DeepSeek-V4-Distill-8000x
下载链接
链接失效反馈
官方服务:
资源简介:
DeepSeek-V4-Distill-8100x是一个用于推理导向蒸馏的监督微调数据集。问题提示来源于Jackrong/GLM-5.1-Reasoning-1M-Cleaned数据集,答案由教师模型DeepSeek-V4-Flash生成。经过清理后,发布的训练集包含7,716个高质量的JSONL示例。数据集格式为JSONL,包含对话式和直接输入/输出字段。主要用途包括推理导向的监督微调、蒸馏实验和格式转换实验。但需要注意,教师生成的响应可能包含事实错误、推理伪影或风格偏见。

DeepSeek-V4-Distill-8100x is a supervised fine-tuning dataset for reasoning-oriented distillation. The question prompts come from Jackrong/GLM-5.1-Reasoning-1M-Cleaned, and the answers were generated by the teacher model DeepSeek-V4-Flash. After the cleaning process, the released train split contains 7,716 high-quality JSONL examples. The dataset format is JSONL, including both conversation-style and direct input/output fields. It is primarily intended for reasoning-oriented supervised fine-tuning, distillation experiments, and format conversion experiments. However, it should be noted that the teacher-generated responses may contain factual errors, reasoning artifacts, or style biases.
提供机构:
Jackrong
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作