peiyi9979/Math-Shepherd
收藏数据集卡片:Math-Shepherd
数据加载
python from datasets import load_dataset dataset = load_dataset("peiyi9979/Math-Shepherd")
数据实例
每个实例包含三个数据字段:"input"、"label" 和 "task"。
- "input":问题 + 逐步解决方案,例如:
If Buzz bought a pizza with 78 slices at a restaurant and then decided to share it with the waiter in the ratio of 5:8, with Buzzs ratio being 5, whats twenty less the number of slices of pizza that the waiter ate?
Step 1: The total ratio representing the pizza is 5+8 = <<5+8=13>>13. ки
Step 2: The waiter ate 13 x 8 / 13 = <<13*8/13=6>>6 slices of the pizza. ки
Step 3: Buzz ate 78 - 6 = <<78-6=72>>72 slices of the pizza. ки
Step 4: The waiter ate 20 less than the number of slices that Buzz ate which is 72 - 20 = 52. ки
Step 5: The waiter ate 52 slices of the pizza. The answer is: 52 ки
- "label":问题 + 逐步解决方案与自动标签,例如:
If Buzz bought a pizza with 78 slices at a restaurant and then decided to share it with the waiter in the ratio of 5:8, with Buzzs ratio being 5, whats twenty less the number of slices of pizza that the waiter ate?
Step 1: The total ratio representing the pizza is 5+8 = <<5+8=13>>13. +
Step 2: The waiter ate 13 x 8 / 13 = <<13*8/13=6>>6 slices of the pizza. -
Step 3: Buzz ate 78 - 6 = <<78-6=72>>72 slices of the pizza. -
Step 4: The waiter ate 20 less than the number of slices that Buzz ate which is 72 - 20 = 52. -
Step 5: The waiter ate 52 slices of the pizza. The answer is: 52 -
- "task":
GSM8K或MATH。
注释:
- "
ки" 是一个独特的标记,表示预测步骤分数的位置。 - "
+" 表示一个好的步骤,因为它有可能导致正确答案。 - "
-" 表示一个不好的步骤。 - 在训练PRMs时,我们只计算
ки位置的损失。




