inclusionAI/Ring-lite-distill-preview-dpo-data
收藏Hugging Face2025-04-15 更新2025-05-31 收录
下载链接:
https://hf-mirror.com/datasets/inclusionAI/Ring-lite-distill-preview-dpo-data
下载链接
链接失效反馈官方服务:
资源简介:
Ring-lite-distill-preview数据集包括两个部分:Ring-lite-distill-preview-sft-data和Ring-lite-distill-preview-dpo-data。其中,Ring-lite-distill-preview-dpo-data是用于训练Ring-lite-distill-preview模型的DPO数据子集,它包含了大约4000个针对复杂推理任务和指令遵循的高质量英文和中文样本。
The Ring-lite-distill-preview Dataset consists of two components: Ring-lite-distill-preview-sft-data and Ring-lite-distill-preview-dpo-data. The Ring-lite-distill-preview-dpo-data is a subset of DPO data used to train the Ring-lite-distill-preview model, which includes approximately 4K high-quality English and Chinese samples focused on complex reasoning tasks and instruction following.
提供机构:
inclusionAI



