science-of-finetuning/lmsys-chat-1m-chat-formatted

Name: science-of-finetuning/lmsys-chat-1m-chat-formatted
Creator: science-of-finetuning
Published: 2025-02-11 05:22:06
License: 暂无描述

Hugging Face2025-02-11 更新2025-02-15 收录

下载链接：

https://hf-mirror.com/datasets/science-of-finetuning/lmsys-chat-1m-chat-formatted

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个包含对话信息的集合，其中包括会话ID、所使用的模型类型、会话的具体内容（发言内容和角色）、对话轮次、使用的语言、OpenAI内容审查的详细分类和对应分数、是否被标记为问题内容、是否经过编辑、原始文本内容、文本的基础格式以及Qwen2.5格式的文本。数据集分为训练集和验证集，分别用于模型的训练和验证。

This dataset is a collection of conversational information, including conversation ID, model type, specific content of the conversation (speech content and role), turn of the conversation, language used, detailed classification and corresponding scores of OpenAI content moderation, whether it is marked as problematic content, whether it has been edited, original text content, basic format of the text, and text in Qwen2.5 format. The dataset is divided into training and validation sets for model training and validation respectively.

提供机构：

science-of-finetuning

5,000+

优质数据集

54 个

任务类型

进入经典数据集