AQ-MedAI/Kimi-K2-Instruct-open-perfectblend-regenerate
收藏Hugging Face2025-12-22 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/AQ-MedAI/Kimi-K2-Instruct-open-perfectblend-regenerate
下载链接
链接失效反馈官方服务:
资源简介:
Kimi-K2-Instruct-open-perfectblend-regenerate是一个综合性的合成数据集,专门为支持Kimi-K2-Instruct-eagle3架构而设计。该数据集基于Open-PerfectBlend的140万样本,使用Kimi-K2-Instruct模型重新生成响应,以确保训练数据与教师模型的概率分布完美对齐。这一蒸馏过程对于有效训练推测解码层(EAGLE3)至关重要。数据集用于增强模型处理多任务场景的能力,包括复杂数学推理、代码生成和聊天对话,通过提供直接从教师模型获得的高一致性训练信号来实现。
Kimi-K2-Instruct-open-perfectblend-regenerate is a comprehensive synthetic dataset specifically curated to support the Kimi-K2-Instruct-eagle3 architecture. Similar to its Ling-Flash counterpart, this dataset takes the 1.4 million samples from the Open-PerfectBlend dataset and regenerates the responses using the Kimi-K2-Instruct model. This distillation process ensures that the training data is perfectly aligned with the teacher models probability distribution, which is critical for the effective training of speculative decoding layers (EAGLE3). The dataset significantly enhances the models ability to handle multi-task scenarios—including complex mathematical reasoning, code generation, and chat dialogue—by providing high-consistency training signals derived directly from the teacher model.
提供机构:
AQ-MedAI



