MathEval
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/math-eval/matheval
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个用于评估数学语言模型的基准数据集,它是专门为数学领域的SLM(特定语言模型)评估基准的一部分而设计的。该数据集的规模各异,其任务旨在对数学语言模型进行评估。
This dataset is a benchmark for evaluating mathematical language models, specifically designed as part of the evaluation benchmark for SLMs (Specific Language Models) in the mathematical domain. It has varying scales, with its tasks dedicated to evaluating mathematical language models.



