five

nvidia/Nemotron-RL-math-stack_overflow

收藏
Hugging Face2025-12-12 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/nvidia/Nemotron-RL-math-stack_overflow
下载链接
链接失效反馈
官方服务:
资源简介:
Nemotron-RL-math-stack_overflow数据集包含从Stack Overflow论坛提取的数学问题及其解决方案。提取问题及其解决方案的方法类似于创建OpenMathReasoning数据集的方法。该数据集是NVIDIA NeMo Gym框架的一部分,用于训练大型语言模型的强化学习环境。数据集格式为纯文本,与NeMo Gym兼容,包含436307个(问题,答案)元组,总存储量为271.4 MiB。数据集适用于商业用途,使用CC-BY-SA 4.0许可。

The Nemotron-RL-math-stack_overflow dataset contains mathematical problems and solutions sourced from the Stack Overflow forums. The method of extracting problems and solutions from forum posts was similar to the one used to create the OpenMathReasoning dataset. This dataset is released as part of NVIDIA NeMo Gym, a framework for building reinforcement learning environments to train large language models. The dataset is in text-only format, compatible with NeMo Gym, and contains 436307 tuples of (question, answer) with a total data storage of 271.4 MiB. The dataset is ready for commercial use under the CC-BY-SA 4.0 license.
提供机构:
nvidia
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作