nvidia/Nemotron-RL-math-stack_overflow
收藏Hugging Face2025-12-12 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/nvidia/Nemotron-RL-math-stack_overflow
下载链接
链接失效反馈官方服务:
资源简介:
Nemotron-RL-math-stack_overflow数据集包含从Stack Overflow论坛提取的数学问题及其解决方案。提取问题及其解决方案的方法类似于创建OpenMathReasoning数据集的方法。该数据集是NVIDIA NeMo Gym框架的一部分,用于训练大型语言模型的强化学习环境。数据集格式为纯文本,与NeMo Gym兼容,包含436307个(问题,答案)元组,总存储量为271.4 MiB。数据集适用于商业用途,使用CC-BY-SA 4.0许可。
The Nemotron-RL-math-stack_overflow dataset contains mathematical problems and solutions sourced from the Stack Overflow forums. The method of extracting problems and solutions from forum posts was similar to the one used to create the OpenMathReasoning dataset. This dataset is released as part of NVIDIA NeMo Gym, a framework for building reinforcement learning environments to train large language models. The dataset is in text-only format, compatible with NeMo Gym, and contains 436307 tuples of (question, answer) with a total data storage of 271.4 MiB. The dataset is ready for commercial use under the CC-BY-SA 4.0 license.
提供机构:
nvidia



