decent-jawfish/test1
收藏数据集卡片 for test1
数据集概述
该数据集包含一个 pipeline.yaml 文件,可以使用 distilabel CLI 重现生成该数据集的管道:
console distilabel pipeline run --config "https://huggingface.co/datasets/decent-jawfish/test1/raw/main/pipeline.yaml"
或者探索配置:
console distilabel pipeline info --config "https://huggingface.co/datasets/decent-jawfish/test1/raw/main/pipeline.yaml"
数据集结构
每个配置的示例具有以下结构:
<details><summary> 配置: default </summary><hr>
json { "completion": "Denote the number of chocolates each person has by the letter of their first name. We know that A = D + 12 D = R + 6 A = 2 * R
Thus, A = (R + 6) + 12 = R + 18 Since also A = 2 * R, this means 2 * R = R + 18 Hence R = 18 Hence D = 18 + 6 = 24", "generation": "Let x be the number of chocolates Robbie has. Danny has x + 6 chocolates. Arianna has 2x chocolates. 2x = x + 6 + 12 2*x = x + 18 x = 18 Danny has 18 + 6 = u003cu003c18+6=24u003eu003e24 chocolates. Answer: \boxed{24}.", "instruction": "Arianna has 12 chocolates more than Danny. Danny has 6 chocolates more than Robbie. Arianna has twice as many chocolates as Robbie has. How many chocolates does Danny have?", "meta": { "category": "Question Answering", "completion": "Denote the number of chocolates each person has by the letter of their first name. We know that A = D + 12 D = R + 6 A = 2 * R
Thus, A = (R + 6) + 12 = R + 18 Since also A = 2 * R, this means 2 * R = R + 18 Hence R = 18 Hence D = 18 + 6 = 24", "id": 0, "input": null, "motivation_app": null, "prompt": "Arianna has 12 chocolates more than Danny. Danny has 6 chocolates more than Robbie. Arianna has twice as many chocolates as Robbie has. How many chocolates does Danny have?", "source": "surge", "subcategory": "Math" }, "model_name": "gpt35_16k" }
该子集可以加载为:
python from datasets import load_dataset
ds = load_dataset("decent-jawfish/test1", "default")
或者简单地加载,因为只有一个配置并且命名为 default:
python from datasets import load_dataset
ds = load_dataset("decent-jawfish/test1")
</details>



