five

AoPS-Instruct

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/DSL-Lab/aops
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了从“艺术解决问题”论坛中提取的超过60万对高质量的问题与答案,旨在提升大型语言模型在推理能力上的表现。此外,该数据集中还包含了从常用的数学基准测试中清洗过的数据,以避免与现有数据重叠。规模上,该数据集超过了60万对问题与答案,任务则是针对奥林匹克级别的数学问题,对大型语言模型进行训练和评估。

This dataset contains over 600,000 high-quality question-answer pairs extracted from the Art of Problem Solving (AoPS) forum, with the objective of enhancing the reasoning capabilities of large language models (LLMs). Additionally, it includes curated data cleaned from common mathematical benchmarks to avoid overlap with existing datasets. In terms of scale, the dataset has more than 600,000 question-answer pairs, and the tasks are designed to train and evaluate LLMs on Olympiad-level mathematical problems.
提供机构:
Art of Problem Solving forum
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作