five

MathArena/arxivlean-0326

收藏
Hugging Face2026-04-21 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/MathArena/arxivlean-0326
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: problem_idx dtype: int64 - name: problem dtype: string - name: answer dtype: string - name: formal_statement dtype: string - name: source dtype: string - name: title dtype: string - name: authors dtype: string splits: - name: train num_bytes: 68093 num_examples: 41 download_size: 46955 dataset_size: 68093 configs: - config_name: default data_files: - split: train path: data/train-* license: cc-by-sa-4.0 language: - en pretty_name: ArXivLean March 2026 size_categories: - n<1K --- ### Homepage and repository - **Homepage:** [https://matharena.ai/](https://matharena.ai/) - **Repository:** [https://github.com/eth-sri/matharena](https://github.com/eth-sri/matharena) ### Dataset Summary This dataset contains the questions from ArXivLean March 2026 used for the MathArena Leaderboard ### Data Fields Below one can find the description of each field in the dataset. - `problem_idx` (int): Index of the problem in the competition - `problem` (str): Full problem statement - `formal_statement` (str): Formal statement - `source` (str): Source paper of the statement - `title` (str): Title of the source paper - `authors` (str): Authors of the paper. ### Licensing Information This dataset is licensed under the Attribution-ShareAlike 4.0 International (CC BY-SA 4.0). Please abide by the license when using the provided data. ### Citation Information ``` @misc{balunovic_srimatharena_2025, title = {MathArena: Evaluating LLMs on Uncontaminated Math Competitions}, author = {Mislav Balunović and Jasper Dekoninck and Ivo Petrov and Nikola Jovanović and Martin Vechev}, copyright = {MIT}, url = {https://matharena.ai/}, publisher = {SRI Lab, ETH Zurich}, month = feb, year = {2025}, } ```
提供机构:
MathArena
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作