lmr-123/DeepSeek-ProverBench
收藏Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/lmr-123/DeepSeek-ProverBench
下载链接
链接失效反馈官方服务:
资源简介:
ProverBench是一个基准数据集,包含325个问题。其中15个问题来自最近的AIME竞赛(AIME 24和25),涉及数论和代数问题,提供了真实的高中竞赛级别挑战。其余310个问题来自精选的教科书示例和教育教程,涵盖了数论、初等代数、线性代数、抽象代数、微积分、实分析、复分析、泛函分析和概率等多个数学领域。该数据集旨在为高中竞赛问题和本科级数学问题提供更全面的评估。
ProverBench is a benchmark dataset comprising 325 problems. Of these, 15 are formalized from number theory and algebra questions featured in the recent AIME competitions (AIME 24 and 25), offering authentic high-school competition-level challenges. The remaining 310 problems are drawn from curated textbook examples and educational tutorials, contributing a diverse and pedagogically grounded collection of formalized mathematical problems. This benchmark is designed to enable more comprehensive evaluation across both high-school competition problems and undergraduate-level mathematics.
提供机构:
lmr-123



