five

distilabel-internal-testing/test-distiset-extra-info

收藏
Hugging Face2024-03-26 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/distilabel-internal-testing/test-distiset-extra-info
下载链接
链接失效反馈
官方服务:
资源简介:
--- size_categories: n<1K config_names: - load_data tags: - synthetic - distilabel - rlaif --- <p align="left"> <a href="https://github.com/argilla-io/distilabel"> <img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="Built with Distilabel" width="200" height="32"/> </a> </p> # Dataset Card for test-distiset-extra-info This dataset has been created with [Distilabel](https://distilabel.argilla.io/). ## Dataset Summary This dataset contains a `pipeline.yaml` which can be used to reproduce the pipeline that generated it in distilabel using the `distilabel` CLI: ```console ❯ distilabel pipeline run --config "https://huggingface.co/datasets/distilabel-internal-testing/test-distiset-extra-info/raw/main/pipeline.yaml" ``` or explore the configuration: ```console ❯ distilabel pipeline info --config "https://huggingface.co/datasets/distilabel-internal-testing/test-distiset-extra-info/raw/main/pipeline.yaml" ``` ## Dataset structure The examples have the following structure per configuration: <details><summary> Configuration: load_data </summary><hr> ```json { "completion": "The Beatles are widely credited with starting the British Invasion, a wave of rock and roll bands from the United Kingdom that became popular in America in the 1960s. If the Beatles had never formed, this musical movement may have never happened, and the world may have been exposed to a different kind of music. So, without the Beatles\u2019 fame and success, other bands wouldn\u2019t have been able to break into the American music scene and create a wider range of sounds. We could have ended up with a less interesting mix of songs playing on the radio.", "instruction": "What if the Beatles had never formed as a band?" } ``` This subset can be loaded as: ```python from datasets import load_dataset ds = load_dataset("distilabel-internal-testing/test-distiset-extra-info", "load_data") ``` </details>
提供机构:
distilabel-internal-testing
原始信息汇总

数据集概述

数据集名称

  • 名称: test-distiset-extra-info

数据集创建工具

  • 创建工具: Distilabel

数据集摘要

  • 包含一个pipeline.yaml文件,用于在Distilabel中重现生成该数据集的流程。

数据集结构

  • 配置名称: load_data
  • 数据结构:
    • 示例内容: json { "completion": "The Beatles are widely credited with starting the British Invasion, a wave of rock and roll bands from the United Kingdom that became popular in America in the 1960s. If the Beatles had never formed, this musical movement may have never happened, and the world may have been exposed to a different kind of music. So, without the Beatlesu2019 fame and success, other bands wouldnu2019t have been able to break into the American music scene and create a wider range of sounds. We could have ended up with a less interesting mix of songs playing on the radio.", "instruction": "What if the Beatles had never formed as a band?" }

数据集加载

  • 加载方式: python from datasets import load_dataset

    ds = load_dataset("distilabel-internal-testing/test-distiset-extra-info", "load_data")

数据集标签

  • 标签:
    • synthetic
    • distilabel
    • rlaif

数据集大小

  • 大小: n<1K
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作