imo-2025
收藏魔搭社区2026-01-06 更新2025-07-19 收录
下载链接:
https://modelscope.cn/datasets/lmms-lab/imo-2025
下载链接
链接失效反馈官方服务:
资源简介:
# IMO 2025 Problems Dataset
This dataset contains the 6 problems from the 2025 International Mathematical Olympiad (IMO). The problems are formatted with proper LaTeX notation for mathematical expressions.
## Dataset Structure
Each example contains:
- `id`: Problem identifier (e.g., "2025-imo-p1")
- `problem`: The problem statement with LaTeX mathematical notation
- `solution`: The solution (currently set to null)
## Problem Types
The dataset includes problems covering various mathematical areas:
1. **Problem 1**: Combinatorial geometry (sunny lines)
2. **Problem 2**: Euclidean geometry (circles and triangles)
3. **Problem 3**: Number theory (bonza functions)
4. **Problem 4**: Number theory (proper divisors and sequences)
5. **Problem 5**: Game theory (inekoalaty game)
6. **Problem 6**: Combinatorial geometry (grid tiling)
## Mathematical Notation
Mathematical expressions are formatted using LaTeX:
- Variables and expressions: `$x$`, `$n \geq 3$`
- Display equations: `$$f(a) \text{ divides } b^a - f(b)^{f(a)}$$`
- Sets: `$\mathbb{N}$`, `$\mathbb{R}$`
- Special formatting: *sunny*, *bonza*, *proper divisor*, *inekoalaty game*
## Files
- `imo_2025.json`: Full dataset in JSON format
- `README.md`: This file
## Usage
```python
from datasets import load_dataset
# Load the dataset
dataset = load_dataset("lmms-lab/imo-2025")
# Access individual problems
for problem in dataset['train']:
print(f"Problem: {problem['id']}")
print(f"Statement: {problem['problem']}")
print()
```
Or load directly from JSON:
```python
import json
# Load from JSON
with open("imo_2025.json", "r") as f:
problems = json.load(f)
# Access problems
for problem in problems:
print(f"ID: {problem['id']}")
print(f"Problem: {problem['problem']}")
print(f"Solution: {problem['solution']}")
print()
```
## Citation
If you use this dataset in your research, please cite:
```bibtex
@dataset{imo2025,
title={IMO 2025 Problems Dataset},
author={LMMS Lab},
year={2025},
url={https://huggingface.co/datasets/lmms-lab/imo-2025}
}
```
## Source
Problems are from the 2025 International Mathematical Olympiad.
Original source: https://www.imo-official.org/problems.aspx
## License
This dataset is released under the MIT License.
# 2025年国际数学奥林匹克(International Mathematical Olympiad, IMO)试题数据集
本数据集收录了2025年国际数学奥林匹克的全部6道试题,所有试题均采用标准LaTeX语法编排数学表达式。
## 数据集结构
每个数据样本包含以下字段:
- `id`: 试题标识符(例如:"2025-imo-p1")
- `problem`: 带有LaTeX数学标记的试题题干
- `solution`: 试题解答(当前设置为空值)
## 试题分类
本数据集涵盖多个数学分支的试题:
1. **第1题**:组合几何(阳光直线(sunny lines))
2. **第2题**:欧几里得几何(圆与三角形问题)
3. **第3题**:数论(邦扎函数(bonza functions))
4. **第4题**:数论(真约数与序列(proper divisors and sequences))
5. **第5题**:博弈论(伊诺科拉蒂博弈(inekoalaty game))
6. **第6题**:组合几何(网格铺砌(grid tiling))
## 数学标记规范
数学表达式采用LaTeX格式编排:
- 变量与表达式:`$x$`、`$n geq 3$`
- 行间公式:`$$f(a) ext{ 整除 } b^a - f(b)^{f(a)}$$`
- 数集:`$mathbb{N}$`、`$mathbb{R}$`
- 特殊格式标注:*sunny*、*bonza*、*proper divisor*、*inekoalaty game*
## 数据文件
- `imo_2025.json`: 完整的JSON格式数据集文件
- `README.md`: 本说明文档
## 使用方法
### 通过Hugging Face Datasets库加载
python
from datasets import load_dataset
# 加载数据集
dataset = load_dataset("lmms-lab/imo-2025")
# 遍历访问单个试题
for problem in dataset['train']:
print(f"试题编号:{problem['id']}")
print(f"试题题干:{problem['problem']}")
print()
### 直接从JSON文件加载
python
import json
# 读取JSON文件
with open("imo_2025.json", "r", encoding="utf-8") as f:
problems = json.load(f)
# 遍历访问试题
for problem in problems:
print(f"试题编号:{problem['id']}")
print(f"试题题干:{problem['problem']}")
print(f"试题解答:{problem['solution']}")
print()
## 引用声明
若您在研究中使用本数据集,请引用如下BibTeX条目:
bibtex
@dataset{imo2025,
title={IMO 2025 Problems Dataset},
author={LMMS Lab},
year={2025},
url={https://huggingface.co/datasets/lmms-lab/imo-2025}
}
## 数据集来源
本数据集的试题来自2025年国际数学奥林匹克,原始来源:https://www.imo-official.org/problems.aspx
## 开源许可
本数据集采用MIT许可协议发布。
提供机构:
maas
创建时间:
2025-07-18
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集收录了2025年国际数学奥林匹克竞赛(IMO)的6道题目,问题陈述使用规范的LaTeX数学符号表示,覆盖组合几何、数论和博弈论等多个数学领域。数据集以JSON格式提供,便于直接加载和使用,但当前解决方案字段为空,主要适用于数学问题求解或竞赛研究场景。
以上内容由遇见数据集搜集并总结生成



