Game-of-24
收藏arXiv2025-09-30 收录
下载链接:
https://en.wikipedia.org/wiki/24_(puzzle)
下载链接
链接失效反馈官方服务:
资源简介:
该数据集旨在通过使用基本的算术运算将四个输入数字结合起来,以达到数字24的目标。每个输入数字只能使用一次,评估过程采用基于过程验证器来检查中间步骤。该数据集规模包括900个训练案例和100个测试案例,任务类型为数学推理。
This dataset is designed to combine four input numbers using basic arithmetic operations to reach the target number 24. Each input number may only be utilized exactly once. The evaluation process employs a process-based validator to inspect the intermediate steps. The dataset includes 900 training instances and 100 test instances, and the task belongs to the category of mathematical reasoning.



