MetaMathQA-R1

Name: MetaMathQA-R1
Creator: maas
Published: 2025-12-05 16:29:54
License: 暂无描述

魔搭社区2025-12-05 更新2025-04-12 收录

下载链接：

https://modelscope.cn/datasets/oumi-ai/MetaMathQA-R1

下载链接

链接失效反馈

官方服务：

资源简介：

[![oumi logo](https://oumi.ai/logo_lockup_black.svg)](https://github.com/oumi-ai/oumi) [![Made with Oumi](https://badgen.net/badge/Made%20with/Oumi/%23085CFF?icon=https%3A%2F%2Foumi.ai%2Flogo_dark.svg)](https://github.com/oumi-ai/oumi) [![Documentation](https://img.shields.io/badge/Documentation-oumi-blue.svg)](https://oumi.ai/docs/en/latest/index.html) [![Blog](https://img.shields.io/badge/Blog-oumi-blue.svg)](https://oumi.ai/blog) [![Discord](https://img.shields.io/discord/1286348126797430814?label=Discord)](https://discord.gg/oumi) # oumi-ai/MetaMathQA-R1 **MetaMathQA-R1** is a text dataset designed to train Conversational Language Models with **DeepSeek-R1 level reasoning**. Prompts were augmented from [GSM8K](https://huggingface.co/datasets/openai/gsm8k) and [MATH](https://github.com/hendrycks/math) training sets with responses directly from **[DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1)**. **MetaMathQA-R1** was used to train **[MiniMath-R1-1.5B](https://huggingface.co/oumi-ai/MiniMath-R1-1.5B)**, which achieves **44.4% accuracy on MMLU-Pro-Math**, the **highest of any model with <=1.5B parameters**. - **Curated by:** [Oumi AI](https://oumi.ai/) using Oumi inference on [Parasail](https://www.parasail.io/) - **Language(s) (NLP):** English - **License:** [Apache 2.0](https://opensource.org/license/apache-2-0) - **Demo:** [Notebook](https://github.com/oumi-ai/oumi/blob/307436bd98706cb9ce7b0bbf31204770af2b7c8c/notebooks/Oumi%20-%20MiniMath-R1-1.5B.ipynb) ## Uses  Use this dataset for supervised fine-tuning of LLMs for improved performance in math by enabling inference-time reasoning. Can also be incorporated into a training mixture for creating an R1-like model. ## Out-of-Scope Use  This dataset is not well suited for producing generalized chat models on its own due to being entirely math queries. ## Dataset Structure  ``` { # Unique conversation identifier, tied back to MetaMathQA samples "conversation_id": str, # The user turn/prompt "prompt": str, # The assistant (DeepSeek R1) response # Includes the thought trace which is wrapped in <think> and </think> tags "response": str, # Data formatted to user + assistant turns in chat format # Example: [{'role': 'user', 'content': ...}, {'role': 'assistant', 'content': ...}] "messages": list[dict[str, str]], # Metadata for sample "metadata": dict[str, ...], } ``` ## Dataset Creation ### Curation Rationale  To enable the community to develop a fully-open state-of-the-art Foundational Language Model, we've produced and released this dataset to serve as part of the foundation of reasoning data for the model. It was produced using the Oumi’s inference capabilities on Parasail. ### Source Data  Queries were sourced from [MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA) which is data augmented from the training sets of GSM8K and MATH. #### Data Collection and Processing  * Responses were collected via Oumi's batch inference support for [Parasail](https://parasail.io/). * Samples which could not be parsed were discarded (<100). * All other samples include metadata indicating if they are complete or not (which was determined by whether or not a `</think>` token is present) #### Personal and Sensitive Information  Data is not known or likely to contain any personal, sensitive, or private information. ## Bias, Risks, and Limitations  1. The source prompts are from [MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA) `query` column and may reflect any biases in their data augmentation process. 2. The responses produced will likely be reflective of any biases or limitations produced by DeepSeek-R1. ## Citation  **BibTeX:** ``` @misc{metaMathQaR12025, author = {Jeremiah Greer}, title = {MetaMathQA-R1 Dataset}, month = {February}, year = {2025}, url = {https://huggingface.co/datasets/oumi-ai/MetaMathQA-R1} } @software{oumi2025, author = {Oumi Community}, title = {Oumi: an Open, End-to-end Platform for Building Large Foundation Models}, month = {January}, year = {2025}, url = {https://github.com/oumi-ai/oumi} } ```

[![oumi logo](https://oumi.ai/logo_lockup_black.svg)](https://github.com/oumi-ai/oumi) [![Made with Oumi](https://badgen.net/badge/Made%20with/Oumi/%23085CFF?icon=https%3A%2F%2Foumi.ai%2Flogo_dark.svg)](https://github.com/oumi-ai/oumi) [![Documentation](https://img.shields.io/badge/Documentation-oumi-blue.svg)](https://oumi.ai/docs/en/latest/index.html) [![Blog](https://img.shields.io/badge/Blog-oumi-blue.svg)](https://oumi.ai/blog) [![Discord](https://img.shields.io/discord/1286348126797430814?label=Discord)](https://discord.gg/oumi) # oumi-ai/MetaMathQA-R1 **MetaMathQA-R1** 是一款专为训练具备**DeepSeek-R1 级推理能力**的对话语言模型设计的文本数据集。其提示词源自[GSM8K](https://huggingface.co/datasets/openai/gsm8k)与[MATH](https://github.com/hendrycks/math)训练集的增强版数据，响应内容则直接来自**[DeepSeek-R1](https://huggingface.co/deepseek-ai/DeepSeek-R1)**。**MetaMathQA-R1** 曾用于训练**[MiniMath-R1-1.5B](https://huggingface.co/oumi-ai/MiniMath-R1-1.5B)**，该模型在**MMLU-Pro-Math**上实现了44.4%的准确率，为参数量≤15亿的模型中的最高水平。 - **整理方：** [Oumi AI](https://oumi.ai/)，依托Parasail平台的Oumi推理服务完成制作 - **NLP所用语言：** 英语 - **许可协议：** [Apache 2.0](https://opensource.org/license/apache-2-0) - **演示示例：** [Notebook](https://github.com/oumi-ai/oumi/blob/307436bd98706cb9ce7b0bbf31204770af2b7c8c/notebooks/Oumi%20-%20MiniMath-R1-1.5B.ipynb) ## 使用场景  本数据集适用于对大语言模型进行监督微调，以提升其数学推理性能，支持推理阶段的思维链生成。也可被纳入训练混合数据集，用于构建类似R1的推理模型。 ## 不适用场景  由于本数据集仅包含数学查询类内容，单独使用本数据集无法训练得到通用对话模型。 ## 数据集结构  { # 唯一对话标识符，关联至MetaMathQA样本 "conversation_id": str, # 用户轮次的提示词 "prompt": str, # 助手（DeepSeek R1）的响应内容 # 包含包裹在<think>与</think>标签内的思维链痕迹 "response": str, # 按照用户+助手轮次格式化的对话数据 # 示例：[{'role': 'user', 'content': ...}, {'role': 'assistant', 'content': ...}] "messages": list[dict[str, str]], # 样本元数据 "metadata": dict[str, ...], } ## 数据集构建 ### 整理初衷  为助力社区开发完全开源的顶尖基础大语言模型，我们制作并发布本数据集，作为模型推理训练数据的核心组成部分。本数据集依托Parasail平台的Oumi推理能力完成制作。 ### 源数据  查询问题源自[MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA)，而该数据集本身是基于GSM8K与MATH训练集增强得到的。 #### 数据收集与处理流程  * 响应内容通过Oumi在Parasail平台的批量推理服务获取。 * 无法正常解析的样本已被剔除（数量少于100）。 * 其余所有样本均包含元数据，用于标记样本是否完整（判断依据为是否存在`</think>`标记）。 #### 个人与敏感信息  本数据集未包含任何已知或疑似的个人、敏感或私有信息。 ## 偏差、风险与局限性  1. 源提示词取自[MetaMathQA](https://huggingface.co/datasets/meta-math/MetaMathQA)的`query`字段，可能继承其数据增强过程中引入的各类偏差。 2. 生成的响应大概率会反映DeepSeek-R1本身存在的偏差与局限性。 ## 引用信息  **BibTeX格式：** @misc{metaMathQaR12025, author = {Jeremiah Greer}, title = {MetaMathQA-R1 数据集}, month = {February}, year = {2025}, url = {https://huggingface.co/datasets/oumi-ai/MetaMathQA-R1} } @software{oumi2025, author = {Oumi Community}, title = {Oumi：开源端到端大基础模型构建平台}, month = {January}, year = {2025}, url = {https://github.com/oumi-ai/oumi} }

提供机构：

maas

创建时间：

2025-04-09

5,000+

优质数据集

54 个

任务类型

进入经典数据集