yoonholee/math-corpus-combined
收藏Hugging Face2026-03-06 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/yoonholee/math-corpus-combined
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
language:
- en
tags:
- math
- retrieval
- competition-math
- olympiad
size_categories:
- 100K<n<1M
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- config_name: val_v4
data_files:
- split: val
path: val_v4/val-*
dataset_info:
config_name: val_v4
features:
- name: problem
dtype: large_string
- name: solution
dtype: large_string
- name: answer
dtype: large_string
- name: category
dtype: large_string
- name: domain
dtype: large_string
- name: source
dtype: large_string
- name: difficulty
dtype: float64
- name: problem_hash
dtype: large_string
- name: grading_guidelines
dtype: large_string
splits:
- name: val
num_bytes: 303236
num_examples: 250
download_size: 161575
dataset_size: 303236
---
# Math Corpus Combined
535K competition math problems with solutions for retrieval-augmented math problem-solving.
## Sources
| Dataset | Rows | HF Link |
|---------|-----:|---------|
| NuminaMath-1.5 | 129,520 | [AI-MO/NuminaMath-1.5](https://huggingface.co/datasets/AI-MO/NuminaMath-1.5) |
| OpenMathReasoning | 281,743 | [nvidia/OpenMathReasoning](https://huggingface.co/datasets/nvidia/OpenMathReasoning) |
| DeepMath-103K | 103,021 | [zwhe99/DeepMath-103K](https://huggingface.co/datasets/zwhe99/DeepMath-103K) |
| PolyMath | 11,083 | [AIMO-Corpus/PolyMath](https://huggingface.co/datasets/AIMO-Corpus/PolyMath) |
| Omni-MATH | 4,289 | [KbsdJames/Omni-MATH](https://huggingface.co/datasets/KbsdJames/Omni-MATH) |
| FineProofs-SFT | 4,275 | [SPIderman5/FineProofs-SFT](https://huggingface.co/datasets/SPIderman5/FineProofs-SFT) |
| AIME 1983-2024 | 933 | [gneubig/aime-1983-2024](https://huggingface.co/datasets/gneubig/aime-1983-2024) |
| Putnam-AXIOM | 492 | [Putnam-AXIOM/putnam-axiom-dataset-v1](https://huggingface.co/datasets/Putnam-AXIOM/putnam-axiom-dataset-v1) |
| **Total** | **535,356** | |
提供机构:
yoonholee



