oumi-ai/lmsys_chat_1m_clean_R1
收藏Hugging Face2025-02-25 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/oumi-ai/lmsys_chat_1m_clean_R1
下载链接
链接失效反馈官方服务:
资源简介:
oumi-ai/lmsys_chat_1m_clean_R1是一个文本数据集,旨在训练具有DeepSeek-R1级别推理能力的对话语言模型。该数据集从LMSYS数据集中筛选并去噪,然后使用DeepSeek-R1生成响应。数据集以Apache 2.0许可证发布,包含英语对话数据,适用于监督微调LLM,以创建类似R1的模型。
oumi-ai/lmsys_chat_1m_clean_R1 is a text dataset designed to train Conversational Language Models with DeepSeek-R1 level reasoning. It is filtered and denoised from the LMSYS dataset with responses generated by DeepSeek-R1. The dataset is released under the Apache 2.0 license and contains English conversational data, suitable for supervised fine-tuning of LLMs to create models similar to R1.
提供机构:
oumi-ai



