argilla-warehouse/smollm-v2-rewriting
收藏Hugging Face2024-10-24 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/argilla-warehouse/smollm-v2-rewriting
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为smollm-v2-rewriting,由distilabel工具生成,主要用于文本重写任务。数据集包含多个特征,如系统提示、指令、内容类型、生成文本等。内容类型包括电子邮件、LinkedIn帖子和推文。数据集分为训练集和测试集,分别包含281075和49602个样本。数据集的目标是通过AI助手将输入文本重写为更简洁的形式,同时保留核心意义。
The dataset named smollm-v2-rewriting is generated using the distilabel tool and is primarily used for text rewriting tasks. The dataset includes multiple features such as system prompts, instructions, types of content, and generated text. The types of content include emails, LinkedIn posts, and tweets. The dataset is divided into training and test sets, containing 281,075 and 49,602 samples respectively. The goal of the dataset is to use an AI assistant to rewrite input text into a more concise form while preserving its core meaning.
提供机构:
argilla-warehouse



