distilabel-internal-testing/test-distiset-extra-info
收藏Hugging Face2024-03-26 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/distilabel-internal-testing/test-distiset-extra-info
下载链接
链接失效反馈官方服务:
资源简介:
---
size_categories: n<1K
config_names:
- load_data
tags:
- synthetic
- distilabel
- rlaif
---
<p align="left">
<a href="https://github.com/argilla-io/distilabel">
<img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="Built with Distilabel" width="200" height="32"/>
</a>
</p>
# Dataset Card for test-distiset-extra-info
This dataset has been created with [Distilabel](https://distilabel.argilla.io/).
## Dataset Summary
This dataset contains a `pipeline.yaml` which can be used to reproduce the pipeline that generated it in distilabel using the `distilabel` CLI:
```console
❯ distilabel pipeline run --config "https://huggingface.co/datasets/distilabel-internal-testing/test-distiset-extra-info/raw/main/pipeline.yaml"
```
or explore the configuration:
```console
❯ distilabel pipeline info --config "https://huggingface.co/datasets/distilabel-internal-testing/test-distiset-extra-info/raw/main/pipeline.yaml"
```
## Dataset structure
The examples have the following structure per configuration:
<details><summary> Configuration: load_data </summary><hr>
```json
{
"completion": "The Beatles are widely credited with starting the British Invasion, a wave of rock and roll bands from the United Kingdom that became popular in America in the 1960s. If the Beatles had never formed, this musical movement may have never happened, and the world may have been exposed to a different kind of music. So, without the Beatles\u2019 fame and success, other bands wouldn\u2019t have been able to break into the American music scene and create a wider range of sounds. We could have ended up with a less interesting mix of songs playing on the radio.",
"instruction": "What if the Beatles had never formed as a band?"
}
```
This subset can be loaded as:
```python
from datasets import load_dataset
ds = load_dataset("distilabel-internal-testing/test-distiset-extra-info", "load_data")
```
</details>
提供机构:
distilabel-internal-testing
原始信息汇总
数据集概述
数据集名称
- 名称: test-distiset-extra-info
数据集创建工具
- 创建工具: Distilabel
数据集摘要
- 包含一个
pipeline.yaml文件,用于在Distilabel中重现生成该数据集的流程。
数据集结构
- 配置名称: load_data
- 数据结构:
- 示例内容: json { "completion": "The Beatles are widely credited with starting the British Invasion, a wave of rock and roll bands from the United Kingdom that became popular in America in the 1960s. If the Beatles had never formed, this musical movement may have never happened, and the world may have been exposed to a different kind of music. So, without the Beatlesu2019 fame and success, other bands wouldnu2019t have been able to break into the American music scene and create a wider range of sounds. We could have ended up with a less interesting mix of songs playing on the radio.", "instruction": "What if the Beatles had never formed as a band?" }
数据集加载
-
加载方式: python from datasets import load_dataset
ds = load_dataset("distilabel-internal-testing/test-distiset-extra-info", "load_data")
数据集标签
- 标签:
- synthetic
- distilabel
- rlaif
数据集大小
- 大小: n<1K



