five

LinhIcey/mathematics_competition

收藏
Hugging Face2026-04-09 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/LinhIcey/mathematics_competition
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en - zh license: apache-2.0 task_categories: - text-generation tags: - math - competition - evaluation size_categories: - 1K<n<10K configs: - config_name: default data_files: - split: test path: data/competition_math_stream.jsonl --- # Mathematics Competition Evaluation Competition-level mathematics evaluation dataset with 3-run predictions from Gemini model. ## Dataset Structure Each row contains: - `uuid`: unique identifier - `question`: math competition problem - `answer`: ground truth answer - `source`: problem source - `run_0`, `run_1`, `run_2`: each a dict with: - `prediction`: model's answer - `stream_output`: list of stream output segments - `stream_output_kinds`: list of output kinds (thought/text/tool_call) - `correct`: whether prediction matches answer - `reserve`: `True` if at least one run is correct, `False` if all three are wrong ## Statistics - **Total**: 3625 problems (with all 3 runs completed) - **reserve=True**: 3165 (87.3%) - **reserve=False**: 460 (12.7%)
提供机构:
LinhIcey
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作