AoPS-Instruct
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/DSL-Lab/aops
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了从“艺术解决问题”论坛中提取的超过60万对高质量的问题与答案,旨在提升大型语言模型在推理能力上的表现。此外,该数据集中还包含了从常用的数学基准测试中清洗过的数据,以避免与现有数据重叠。规模上,该数据集超过了60万对问题与答案,任务则是针对奥林匹克级别的数学问题,对大型语言模型进行训练和评估。
This dataset contains over 600,000 high-quality question-answer pairs extracted from the Art of Problem Solving (AoPS) forum, with the objective of enhancing the reasoning capabilities of large language models (LLMs). Additionally, it includes curated data cleaned from common mathematical benchmarks to avoid overlap with existing datasets. In terms of scale, the dataset has more than 600,000 question-answer pairs, and the tasks are designed to train and evaluate LLMs on Olympiad-level mathematical problems.
提供机构:
Art of Problem Solving forum



