omega-500

Name: omega-500
Creator: maas
Published: 2025-12-05 16:42:10
License: 暂无描述

魔搭社区2025-12-05 更新2025-07-19 收录

下载链接：

https://modelscope.cn/datasets/allenai/omega-500

下载链接

链接失效反馈

官方服务：

资源简介：

# Omega-500: Random Sample of Mathematical Problems This dataset contains a random sample of 500 mathematical problems selected from the comprehensive OMEGA problem families dataset. It provides a diverse, manageable subset for quick evaluation and experimentation across multiple mathematical domains and difficulty levels. ## Overview Omega-500 is designed for: - **Quick Evaluation**: Fast assessment of model capabilities across math domains - **Prototyping**: Testing new approaches before scaling to larger datasets - **Benchmarking**: Standardized subset for fair model comparisons - **Research**: Focused analysis on a balanced mathematical problem set The sample maintains diversity across mathematical domains and difficulty levels while keeping the dataset size manageable for rapid iteration. ## Quick Start ```python from datasets import load_dataset # Load the Omega-500 sample dataset = load_dataset("allenai/omega-500") problems = dataset["train"] # Access individual problems first_problem = problems[0] print("Problem:", first_problem["messages"][0]["content"]) print("Answer:", first_problem["ground_truth"]) print("Family:", first_problem["family"]) print("Difficulty:", first_problem["difficulty_level"]) ``` ## Dataset Composition ### Total Problems: 500 ### Domain Distribution: - **Algebra**: 92 problems (18.4%) - **Arithmetic**: 173 problems (34.6%) - **Combinatorics**: 84 problems (16.8%) - **Geometry**: 45 problems (9.0%) - **Logic**: 61 problems (12.2%) - **Number Theory**: 45 problems (9.0%) ## Data Fields Each problem contains: - `id`: Unique identifier for this sample - `original_id`: Original identifier from source dataset - `family`: Problem family (e.g., "algebra_func_area") - `difficulty_level`: Numeric difficulty level from source - `source_family`: Source family directory name - `source_level`: Source difficulty level name - `messages`: Problem statement in chat format - `ground_truth`: Correct answer - `dataset`: Dataset identifier ("OMEGA_500_SAMPLE") ## Citation If you use this dataset, please cite the original OMEGA work: ```bibtex @article{sun2024omega, title = {OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization}, author = {Yiyou Sun and Shawn Hu and Georgia Zhou and Ken Zheng and Hannaneh Hajishirzi and Nouha Dziri and Dawn Song}, journal = {arXiv preprint arXiv:2506.18880}, year = {2024}, } ``` ## Related Resources - **Full Problem Families**: See [omega-problems](https://huggingface.co/datasets/allenai/omega-problems) for the complete dataset - **Explorative Dataset**: See [omega-explorative](https://huggingface.co/datasets/allenai/omega-explorative) for explorative reasoning challenges - **Compositional Dataset**: See [omega-compositional](https://huggingface.co/datasets/allenai/omega-compositional) for compositional reasoning challenges - **Transformative Dataset**: See [omega-transformative](https://huggingface.co/datasets/allenai/omega-transformative) for transformative reasoning challenges - **Paper**: See the full details in [paper](https://arxiv.org/pdf/2506.18880) - **Code Repository**: See generation code on [github](https://github.com/sunblaze-ucb/math_ood)

# Omega-500：数学问题随机抽样数据集本数据集从完整的OMEGA问题家族数据集（OMEGA problem families dataset）中随机抽取500道数学问题构建而成，提供了一个多样化且规模可控的子集，可用于跨多个数学领域与难度层级的快速评估与实验研究。 ## 概述 Omega-500的设计用途包括： - **快速评估**：快速评估模型在各数学领域的能力 - **原型开发**：在扩展至更大规模数据集前测试新方法 - **基准测试**：用于公平模型对比的标准化子集 - **研究工作**：针对均衡数学问题集开展聚焦性分析该抽样样本在保持数学领域与难度层级多样性的同时，将数据集规模控制在便于快速迭代的范围内。 ## 快速入门 python from datasets import load_dataset # 加载Omega-500抽样数据集 dataset = load_dataset("allenai/omega-500") problems = dataset["train"] # 访问单个问题 first_problem = problems[0] print("Problem:", first_problem["messages"][0]["content"]) print("Answer:", first_problem["ground_truth"]) print("Family:", first_problem["family"]) print("Difficulty:", first_problem["difficulty_level"]) ## 数据集构成 ### 总问题数：500道 ### 领域分布： - **代数（Algebra）**：92道（18.4%） - **算术（Arithmetic）**：173道（34.6%） - **组合数学（Combinatorics）**：84道（16.8%） - **几何（Geometry）**：45道（9.0%） - **逻辑学（Logic）**：61道（12.2%） - **数论（Number Theory）**：45道（9.0%） ## 数据字段每个问题包含以下字段： - `id`：该样本的唯一标识符 - `original_id`：源数据集的原始标识符 - `family`：问题家族（例如："algebra_func_area"） - `difficulty_level`：源数据集给出的数值型难度层级 - `source_family`：源家族目录名称 - `source_level`：源难度层级名称 - `messages`：对话格式的问题陈述 - `ground_truth`：正确答案 - `dataset`：数据集标识符（"OMEGA_500_SAMPLE"） ## 引用说明若使用本数据集，请引用原始OMEGA研究成果： bibtex @article{sun2024omega, title = {OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization}, author = {Yiyou Sun and Shawn Hu and Georgia Zhou and Ken Zheng and Hannaneh Hajishirzi and Nouha Dziri and Dawn Song}, journal = {arXiv preprint arXiv:2506.18880}, year = {2024}, } ## 相关资源 - **完整问题家族数据集**：可访问[omega-problems](https://huggingface.co/datasets/allenai/omega-problems)获取完整数据集 - **探索式推理数据集**：可访问[omega-explorative](https://huggingface.co/datasets/allenai/omega-explorative)获取探索式推理挑战集 - **组合式推理数据集**：可访问[omega-compositional](https://huggingface.co/datasets/allenai/omega-compositional)获取组合式推理挑战集 - **转换式推理数据集**：可访问[omega-transformative](https://huggingface.co/datasets/allenai/omega-transformative)获取转换式推理挑战集 - **研究论文**：详细内容可查阅[论文](https://arxiv.org/pdf/2506.18880) - **代码仓库**：生成代码可访问[GitHub](https://github.com/sunblaze-ucb/math_ood)

提供机构：

maas

创建时间：

2025-07-16

5,000+

优质数据集

54 个

任务类型

进入经典数据集