distilabel-example
收藏数据集概述
基本信息
- 数据集名称: distilabel-example
- 创建工具: distilabel
- 数据集大小: 26,140 字节
- 下载大小: 30,754 字节
- 样本数量: 10
- 类别大小: n<1K
- 标签: synthetic, distilabel, rlaif
数据集结构
特征
- instruction: string
- generation: string
- distilabel_metadata:
- raw_input_text_generation_0:
- content: string
- role: string
- raw_output_text_generation_0: string
- statistics_text_generation_0:
- input_tokens: int64
- output_tokens: int64
- raw_input_text_generation_0:
- model_name: string
数据拆分
- train:
- 字节数: 26,140
- 样本数: 10
配置信息
- 配置名称: default
- 数据文件:
- 拆分: train
- 路径: data/train-*
加载方式
python from datasets import load_dataset
ds = load_dataset("davidmeikle/distilabel-example", "default")
或
python from datasets import load_dataset
ds = load_dataset("davidmeikle/distilabel-example")
数据集示例
json { "distilabel_metadata": { "raw_input_text_generation_0": [ { "content": "Arianna has 12 chocolates more than Danny. Danny has 6 chocolates more than Robbie. Arianna has twice as many chocolates as Robbie has. How many chocolates does Danny have?", "role": "user" } ], "raw_output_text_generation_0": "Lets break it down step by step:
- Lets say Robbie has x chocolates.
- Since Arianna has twice as many chocolates as Robbie, Arianna has 2x chocolates.
- Danny has 6 chocolates more than Robbie, so Danny has x + 6 chocolates.
- But we know that Arianna has 12 chocolates more than Danny, which means: 2x = (x + 6) + 12
- Simplify the equation: 2x = x + 18
- Subtract x from both sides: x = 18
Now that we know Robbie has 18 chocolates, we can find out how many chocolates Danny has: Danny has x + 6 chocolates Danny has 18 + 6 chocolates Danny has 24 chocolates.
So, Danny has 24 chocolates.", "statistics_text_generation_0": { "input_tokens": 39, "output_tokens": 180 } }, "generation": "Lets break it down step by step:
- Lets say Robbie has x chocolates.
- Since Arianna has twice as many chocolates as Robbie, Arianna has 2x chocolates.
- Danny has 6 chocolates more than Robbie, so Danny has x + 6 chocolates.
- But we know that Arianna has 12 chocolates more than Danny, which means: 2x = (x + 6) + 12
- Simplify the equation: 2x = x + 18
- Subtract x from both sides: x = 18
Now that we know Robbie has 18 chocolates, we can find out how many chocolates Danny has: Danny has x + 6 chocolates Danny has 18 + 6 chocolates Danny has 24 chocolates.
So, Danny has 24 chocolates.", "instruction": "Arianna has 12 chocolates more than Danny. Danny has 6 chocolates more than Robbie. Arianna has twice as many chocolates as Robbie has. How many chocolates does Danny have?", "model_name": "unsloth/Llama-3.2-3B-Instruct" }
复制与探索
使用以下命令复制生成该数据集的管道: console distilabel pipeline run --config "https://huggingface.co/datasets/davidmeikle/distilabel-example/raw/main/pipeline.yaml"
或探索配置: console distilabel pipeline info --config "https://huggingface.co/datasets/davidmeikle/distilabel-example/raw/main/pipeline.yaml"




