Nemotron-RL-math-advanced_calculations
收藏魔搭社区2025-12-04 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/nv-community/Nemotron-RL-math-advanced_calculations
下载链接
链接失效反馈官方服务:
资源简介:
## Dataset Description:
The Nemotron-RL-math-advanced_calculations is a dataset designed to test a model's ability to solve complex, multi-step math problems in a multi-step agentic environment. It involves counterintuitive calculations with varying levels of function composition.
This dataset is released as part of NVIDIA [NeMo Gym](https://github.com/NVIDIA-NeMo/Gym), a framework for building reinforcement learning environments to train large language models. NeMo Gym contains a growing collection of training environments and datasets to enable Reinforcement Learning from Verifiable Reward (RLVR).
NeMo Gym is an open-source library within the [NVIDIA NeMo framework](https://github.com/NVIDIA-NeMo/), NVIDIA's GPU accelerated, end-to-end training framework for large language models (LLMs), multi-modal models and speech models.
This dataset is part of the [Nemo Gym Collection](https://huggingface.co/collections/nvidia/nemo-gym).
This dataset is ready for commercial use.
## Dataset Owner(s):
NVIDIA Corporation
## Dataset Creation Date:
September 3rd 2025
## License/Terms of Use:
CC BY 4.0
## Intended Usage:
To be used with [NeMo-Gym](https://github.com/NVIDIA-NeMo/Gym) for post-training LLMs.
## Dataset Characterization
Data Collection Method<br>
* [Synthetic] <br>
Labeling Method<br>
* [Synthetic] <br>
## Dataset Format
Text Only, Compatible with [NeMo-Gym](https://github.com/NVIDIA-NeMo/Gym)
## Dataset Quantification
Record Count - 6K query-answer tuples.
## Reference(s):
[NeMo-Gym](https://github.com/NVIDIA-NeMo/Gym)
## Ethical Considerations:
NVIDIA believes Trustworthy AI is a shared responsibility and we have established policies and practices to enable development for a wide array of AI applications. When downloaded or used in accordance with our terms of service, developers should work with their internal model team to ensure this model meets requirements for the relevant industry and use case and addresses unforeseen product misuse.
Please report model quality, risk, security vulnerabilities or NVIDIA AI Concerns [here](https://www.nvidia.com/en-us/support/submit-security-vulnerability/).
数据集描述:
Nemotron-RL-math-advanced_calculations 数据集专为测试模型在多步AI智能体(AI Agent)环境中求解复杂多步数学问题的能力而设计,涵盖不同复杂度的函数复合反直觉计算任务。
该数据集作为NVIDIA旗下NeMo Gym(https://github.com/NVIDIA-NeMo/Gym)的一部分发布,NeMo Gym是用于构建强化学习环境以训练大语言模型(Large Language Model, LLM)的框架。NeMo Gym收录了日益丰富的训练环境与数据集,以支持可验证奖励强化学习(Reinforcement Learning from Verifiable Reward, RLVR)。
NeMo Gym 是NVIDIA NeMo框架(https://github.com/NVIDIA-NeMo/)下的开源库,该框架是NVIDIA推出的GPU加速型端到端训练框架,支持大语言模型(LLMs)、多模态模型与语音模型。
本数据集隶属于Nemo Gym 数据集合集(https://huggingface.co/collections/nvidia/nemo-gym)。
本数据集可用于商业场景。
数据集所有者:NVIDIA公司
数据集创建日期:2025年9月3日
许可/使用条款:CC BY 4.0
预期用途:需配合NeMo-Gym(https://github.com/NVIDIA-NeMo/Gym)用于大语言模型的后训练阶段。
数据集特征:
数据收集方式:[合成生成]
标注方式:[合成生成]
数据集格式:仅文本格式,兼容NeMo-Gym(https://github.com/NVIDIA-NeMo/Gym)
数据集量化统计:样本总量为6000条查询-答案元组。
参考文献:NeMo-Gym(https://github.com/NVIDIA-NeMo/Gym)
伦理考量:
NVIDIA认为可信人工智能是一项共同责任,我们已建立相关政策与实践规范,以支持各类人工智能应用的开发。开发者在按照服务条款下载或使用本数据集时,应与其内部模型团队协作,确保该模型符合相关行业与应用场景的要求,并防范产品被意外滥用。
若需反馈模型质量、风险、安全漏洞或NVIDIA人工智能相关问题,请访问此处(https://www.nvidia.com/en-us/support/submit-security-vulnerability/)提交。
提供机构:
maas
创建时间:
2025-11-15



