Nemotron-RL-math-OpenMathReasoning

Name: Nemotron-RL-math-OpenMathReasoning
Creator: maas
Published: 2025-12-12 17:33:17
License: 暂无描述

魔搭社区2025-12-12 更新2025-11-22 收录

下载链接：

https://modelscope.cn/datasets/nv-community/Nemotron-RL-math-OpenMathReasoning

下载链接

链接失效反馈

官方服务：

资源简介：

## Dataset Description: The Nemotron-RL-math-OpenMathReasoning dataset contains mathematical problems and solutions sourced from the AoPS forums. These problems and solutions were previously released in the OpenMathReasoning dataset. In the present dataset, they are formatted for use in NeMo-Gym. The method of extracting problems and solutions from forum posts is described in this paper. Only problems for which an answer was extracted are included in the present dataset. This dataset is released as part of NVIDIA [NeMo Gym](https://github.com/NVIDIA-NeMo/Gym), a framework for building reinforcement learning environments to train large language models. NeMo Gym contains a growing collection of training environments and datasets to enable Reinforcement Learning from Verifiable Reward (RLVR). NeMo Gym is an open-source library within the [NVIDIA NeMo framework](https://github.com/NVIDIA-NeMo/), NVIDIA's GPU accelerated, end-to-end training framework for large language models (LLMs), multi-modal models and speech models. This dataset is part of the [NeMo Gym Collection](https://huggingface.co/collections/nvidia/nemo-gym). This dataset is ready for commercial use. ## Dataset Owner(s): NVIDIA Corporation ## Dataset Creation Date: August 20, 2025 ## License/Terms of Use: CC BY 4.0 ## Intended Usage: To be used with [NeMo Gym](https://github.com/NVIDIA-NeMo/Gym) for post-training LLMs. ## Dataset Characterization Data Collection Method<br> * [Hybrid: Human, Automated, Synthetic] <br> Labeling Method<br> * [Synthetic] <br> ## Dataset Format Text Only, Compatible with [NeMo Gym](https://github.com/NVIDIA-NeMo/Gym) ## Dataset Quantification Record Count: 112867 tuples of (question, answer) Total Data Storage: 67.7 MiB (1 MiB = 10242 bytes = 1048576 bytes) ## Reference(s): [NeMo Gym](https://github.com/NVIDIA-NeMo/Gym) ## Ethical Considerations: NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse. Please report model quality, risk, security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/).

## 数据集描述： Nemotron-RL-math-OpenMathReasoning数据集收录源自AoPS论坛的数学问题与解答，此类问题及解答此前已在OpenMathReasoning数据集中公开。本数据集针对[NeMo Gym](https://github.com/NVIDIA-NeMo/Gym)适配了专属格式规范。从论坛帖文中提取问题与解答的具体方法已在本文中详述。本数据集仅纳入已成功提取答案的问题。本数据集作为NVIDIA旗下[NeMo Gym](https://github.com/NVIDIA-NeMo/Gym)的组成部分发布，该框架用于构建强化学习环境以训练大语言模型（Large Language Model）。[NeMo Gym](https://github.com/NVIDIA-NeMo/Gym)收录了持续扩充的训练环境与数据集，旨在支持可验证奖励强化学习（Reinforcement Learning from Verifiable Reward，RLVR）。 [NeMo Gym](https://github.com/NVIDIA-NeMo/Gym)是NVIDIA NeMo框架（https://github.com/NVIDIA-NeMo/）下的开源库，后者是NVIDIA推出的GPU加速型端到端训练框架，可用于大语言模型、多模态模型及语音模型的训练。本数据集隶属于[NeMo Gym数据集合集](https://huggingface.co/collections/nvidia/nemo-gym)。本数据集可用于商业用途。 ## 数据集所有者：英伟达公司 ## 数据集创建日期： 2025年8月20日 ## 许可/使用条款： CC BY 4.0 ## 预期用途：需配合[NeMo Gym](https://github.com/NVIDIA-NeMo/Gym)用于大语言模型的后训练流程。 ## 数据集特征 ### 数据采集方法 * [混合采集：人工、自动化、合成生成] ### 标注方法 * [合成标注] ## 数据集格式仅文本格式，兼容[NeMo Gym](https://github.com/NVIDIA-NeMo/Gym) ## 数据集量化指标记录数量：112867条（问题、答案）元组总数据存储量：67.7 MiB（1 MiB = 1024²字节 = 1048576字节） ## 参考文献： [NeMo Gym](https://github.com/NVIDIA-NeMo/Gym) ## 伦理考量：英伟达认为可信人工智能是一项共同责任，我们已建立相关政策与实践规范，以支撑各类AI应用的开发。开发者在依照服务条款下载或使用本数据集时，应协同内部模型团队，确保该模型符合相关行业及应用场景的要求，并应对潜在的产品误用问题。若需反馈模型质量、风险、安全漏洞或英伟达人工智能相关问题，请访问以下链接：https://www.nvidia.com/en-us/support/submit-security-vulnerability/

提供机构：

maas

创建时间：

2025-11-15

5,000+

优质数据集

54 个

任务类型

进入经典数据集