five

nnmax/OpenMathReasoning

收藏
Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/nnmax/OpenMathReasoning
下载链接
链接失效反馈
官方服务:
资源简介:
OpenMathReasoning是一个大规模数学推理数据集,用于训练大型语言模型(LLMs)。该数据集包含来自AoPS论坛的30.6万个独特数学问题,以及320万条长链思维(CoT)解决方案、170万条工具集成推理(TIR)解决方案和56.6万个从多个候选方案中选择最有希望解决方案的样本(GenSelect)。此外,还包括19.3万个来自AoPS论坛的问题(仅问题,无解决方案)。该数据集是我们在AIMO-2 Kaggle竞赛中获胜的基础。数据集还包含了用于训练OpenMath-Nemotron系列模型的所有数据。

OpenMathReasoning is a large-scale math reasoning dataset for training large language models (LLMs). This dataset contains 306K unique mathematical problems sourced from AoPS forums with 3.2M long chain-of-thought (CoT) solutions, 1.7M long tool-integrated reasoning (TIR) solutions, and 566K samples that select the most promising solution out of many candidates (GenSelect). Additional 193K problems sourced from AoPS forums (problems only, no solutions) are also included. This dataset was a foundation of our winning submission to the AIMO-2 Kaggle competition. The dataset also includes all the data used to train the OpenMath-Nemotron series of models.
提供机构:
nnmax
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作