five

introvoyz041/Nemotron-RL-math-advanced_calculations

收藏
Hugging Face2025-12-12 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/introvoyz041/Nemotron-RL-math-advanced_calculations
下载链接
链接失效反馈
官方服务:
资源简介:
Nemotron-RL-math-advanced_calculations数据集旨在测试模型在多步代理环境中解决复杂多步数学问题的能力,涉及具有不同层次函数组合的反直觉计算。该数据集作为NVIDIA NeMo Gym框架的一部分发布,NeMo Gym是一个用于构建强化学习环境以训练大型语言模型的框架,支持从可验证奖励中进行强化学习(RLVR)。NeMo Gym是NVIDIA NeMo框架中的一个开源库,后者是NVIDIA的GPU加速端到端训练框架,适用于大型语言模型(LLM)、多模态模型和语音模型。该数据集已准备好用于商业用途。

The Nemotron-RL-math-advanced_calculations is a dataset designed to test a models ability to solve complex, multi-step math problems in a multi-step agentic environment. It involves counterintuitive calculations with varying levels of function composition. This dataset is released as part of NVIDIA NeMo Gym, a framework for building reinforcement learning environments to train large language models. NeMo Gym contains a growing collection of training environments and datasets to enable Reinforcement Learning from Verifiable Reward (RLVR). NeMo Gym is an open-source library within the NVIDIA NeMo framework, NVIDIAs GPU accelerated, end-to-end training framework for large language models (LLMs), multi-modal models and speech models. This dataset is ready for commercial use.
提供机构:
introvoyz041
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作