distilabel-internal-testing/dpo-mix-4k-criticurus-temperature0.7-v0.0

Name: distilabel-internal-testing/dpo-mix-4k-criticurus-temperature0.7-v0.0
Creator: distilabel-internal-testing
Published: 2024-04-18 12:54:42
License: 暂无描述

Hugging Face2024-04-18 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/distilabel-internal-testing/dpo-mix-4k-criticurus-temperature0.7-v0.0

下载链接

链接失效反馈

官方服务：

资源简介：

--- size_categories: n<1K dataset_info: features: - name: instruction dtype: string - name: response dtype: string - name: rating dtype: float64 - name: dataset_name dtype: string - name: model_name dtype: string - name: score dtype: string - name: critique dtype: string - name: raw_output dtype: string splits: - name: train num_bytes: 9539790 num_examples: 3996 download_size: 4772223 dataset_size: 9539790 configs: - config_name: default data_files: - split: train path: data/train-* tags: - synthetic - distilabel - rlaif --- <p align="left"> <a href="https://github.com/argilla-io/distilabel"> <img src="https://raw.githubusercontent.com/argilla-io/distilabel/main/docs/assets/distilabel-badge-light.png" alt="Built with Distilabel" width="200" height="32"/> </a> </p> # Dataset Card for dpo-mix-4k-criticurus-temperature0.7-v0.0 This dataset has been created with [distilabel](https://distilabel.argilla.io/). ## Dataset Summary This dataset contains a `pipeline.yaml` which can be used to reproduce the pipeline that generated it in distilabel using the `distilabel` CLI: ```console distilabel pipeline run --config "https://huggingface.co/datasets/distilabel-internal-testing/dpo-mix-4k-criticurus-temperature0.7-v0.0/raw/main/pipeline.yaml" ``` or explore the configuration: ```console distilabel pipeline info --config "https://huggingface.co/datasets/distilabel-internal-testing/dpo-mix-4k-criticurus-temperature0.7-v0.0/raw/main/pipeline.yaml" ``` ## Dataset structure The examples have the following structure per configuration: <details><summary> Configuration: default </summary><hr> ```json { "critique": "You correctly identified the anagram \"dormatory\" as a place where students live. However, the term \"dormatory\" is not commonly used, and it seems like a typo. To improve, ensure that the word you provide is accurate and commonly used in the context given. For instance, you could have used \"dortory\" instead, which is still a form of the word \"dorm restaurant\" but is more logical and correctly speled.", "dataset_name": "argilla/distilabel-capybara-dpo-7k-binarized", "instruction": "A phrase that\u0027s an anagram of \"dirty room\", it refers to a place where students live.", "model_name": "distilabel-internal-testing/criticurus-v0.0", "rating": 5.0, "raw_output": null, "response": "dormitory", "score": "7\u003c|im_end|\u003e" } ``` This subset can be loaded as: ```python from datasets import load_dataset ds = load_dataset("distilabel-internal-testing/dpo-mix-4k-criticurus-temperature0.7-v0.0", "default") ``` Or simply as it follows, since there's only one configuration and is named `default`: ```python from datasets import load_dataset ds = load_dataset("distilabel-internal-testing/dpo-mix-4k-criticurus-temperature0.7-v0.0") ``` </details>

提供机构：

distilabel-internal-testing

5,000+

优质数据集

54 个

任务类型

进入经典数据集