UGMathBench
收藏魔搭社区2025-08-26 更新2025-03-29 收录
下载链接:
https://modelscope.cn/datasets/xinxu02/UGMathBench
下载链接
链接失效反馈官方服务:
资源简介:
UGMathBench is a diverse and dynamic benchmark specifically designed for evaluating undergraduate-level mathematical reasoning with LLMs. UGMathBench comprises 5,062 problems across 16 subjects and 111 topics, featuring 10 distinct answer types. Each problem includes three randomized versions.
UGMathBench是一款专为评估大语言模型(Large Language Model)的本科数学推理能力而打造的多样化且动态的基准测试集。该基准测试集涵盖16个学科、111个主题下的5062道题目,具备10种不同的答案类型,且每道题目均包含三个随机变体版本。
提供机构:
maas
创建时间:
2025-03-25



