seyeon-shijuan/my-distiset-ed97569e
收藏Hugging Face2025-04-08 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/seyeon-shijuan/my-distiset-ed97569e
下载链接
链接失效反馈官方服务:
资源简介:
这是一个使用distilabel工具生成的数据集,包含了用于文本生成、文本到文本生成和问答任务的训练数据。数据集由prompt和completion两个字段组成,其中prompt为字符串类型,completion为null。数据集包含一个名为default的配置,可通过提供的pipeline.yaml文件再现数据生成流程。数据集标签包括合成数据、distilabel、rlaif和数据构建。
This dataset has been generated using distilabel and contains training data for text generation, text-to-text generation, and question-answering tasks. The dataset consists of two fields, prompt which is a string type, and completion which is null. It includes a configuration named default and can reproduce the data generation process with the provided pipeline.yaml file. The dataset is tagged with synthetic data, distilabel, rlaif, and datacraft.
提供机构:
seyeon-shijuan



