distilabel-internal-testing/fine-preferences-magpie-generated-system-prompt-v3
收藏Hugging Face2024-07-18 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/distilabel-internal-testing/fine-preferences-magpie-generated-system-prompt-v3
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是通过distilabel工具生成的,包含了一个`pipeline.yaml`文件,可以用于复现生成该数据集的流程。数据集的样本结构包括多个字段,如`text`、`id`、`dump`、`url`、`file_path`、`language`、`language_score`、`token_count`、`score`、`int_score`、`generated_system_prompt`、`distilabel_metadata`、`gen_conv_model_name`、`system_prompt`、`conversation`、`generations`和`generations_model_names`等。数据集的内容涉及天文学领域,特别是关于伽马射线暴(GRB)及其对星际介质(ISM)的影响的对话。数据集的配置为`default`,包含100个样本,总大小为1697480字节。
This dataset contains a `pipeline.yaml` file that can be used to reproduce the pipeline that generated it using the `distilabel` CLI. The dataset includes various features such as text, id, dump, url, file_path, language, language_score, token_count, score, int_score, generated_system_prompt, distilabel_metadata, gen_conv_model_name, system_prompt, conversation, generations, and generations_model_names. The dataset has a single split named train with 100 examples. The dataset is tagged with synthetic, distilabel, and rlaif.
提供机构:
distilabel-internal-testing



