AQUA
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/noagarcia/artvqa
下载链接
链接失效反馈官方服务:
资源简介:
该数据集专为算术推理设计,包含了多种与数学相关的文字问题。它被用于评估大型语言模型在数学推理任务上的表现。该数据集的任务是算术推理。
This dataset is specifically designed for arithmetic reasoning, containing various mathematics-related word problems. It is used to evaluate the performance of large language models on mathematical reasoning tasks, and the core task of this dataset is arithmetic reasoning.



