elkins_dataset_trial
收藏数据集概述:elkins_dataset_trial
基本信息
- 许可证: MIT
- 数据规模: 小于1K
- 标签: synthetic, distilabel, rlaif
数据集结构
- 配置名称: default
- 数据文件:
- 训练集: data/train-*
- 特征:
- instruction: string
- generation: string
- distilabel_metadata:
- raw_input_text_generation_0:
- content: string
- role: string
- raw_output_text_generation_0: string
- statistics_text_generation_0:
- input_tokens: int64
- output_tokens: int64
- raw_input_text_generation_0:
- model_name: string
数据集统计
- 训练集:
- 字节数: 33774
- 样本数: 10
- 下载大小: 39619
- 数据集大小: 33774
数据集加载方式
python from datasets import load_dataset ds = load_dataset("elkinsqiu/elkins_dataset_trial", "default")
或 python from datasets import load_dataset ds = load_dataset("elkinsqiu/elkins_dataset_trial")
数据集生成
- 生成工具: distilabel
- 配置文件: pipeline.yaml console distilabel pipeline run --config "https://huggingface.co/datasets/elkinsqiu/elkins_dataset_trial/raw/main/pipeline.yaml"
或 console distilabel pipeline info --config "https://huggingface.co/datasets/elkinsqiu/elkins_dataset_trial/raw/main/pipeline.yaml"
示例数据
json { "distilabel_metadata": { "raw_input_text_generation_0": [ { "content": "Arianna has 12 chocolates more than Danny. Danny has 6 chocolates more than Robbie. Arianna has twice as many chocolates as Robbie has. How many chocolates does Danny have?", "role": "user" } ], "raw_output_text_generation_0": "Lets say Robbie has x chocolates.
Since Danny has 6 chocolates more than Robbie, Danny has x + 6 chocolates.
Arianna has 12 chocolates more than Danny, so Arianna has x + 6 + 12 = x + 18 chocolates.
Since Arianna has twice as many chocolates as Robbie, we can say that: x + 18 = 2x (because Arianna has twice as many chocolates as Robbie)
Now, lets solve for x: x + 18 = 2x 18 = 2x - x 18 = x x = 18
Now that we know Robbie has 18 chocolates, we can find out how many Danny has: Danny has x + 6 chocolates Danny has 18 + 6 = 24 chocolates.
So Danny has 24 chocolates.", "statistics_text_generation_0": { "input_tokens": 73, "output_tokens": 172 } }, "generation": "Lets say Robbie has x chocolates.
Since Danny has 6 chocolates more than Robbie, Danny has x + 6 chocolates.
Arianna has 12 chocolates more than Danny, so Arianna has x + 6 + 12 = x + 18 chocolates.
Since Arianna has twice as many chocolates as Robbie, we can say that: x + 18 = 2x (because Arianna has twice as many chocolates as Robbie)
Now, lets solve for x: x + 18 = 2x 18 = 2x - x 18 = x x = 18
Now that we know Robbie has 18 chocolates, we can find out how many Danny has: Danny has x + 6 chocolates Danny has 18 + 6 = 24 chocolates.
So Danny has 24 chocolates.", "instruction": "Arianna has 12 chocolates more than Danny. Danny has 6 chocolates more than Robbie. Arianna has twice as many chocolates as Robbie has. How many chocolates does Danny have?", "model_name": "meta-llama/Llama-3.2-3B-Instruct" }




