five

nvidia/Nemotron-RL-coding-competitive_coding

收藏
Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/nvidia/Nemotron-RL-coding-competitive_coding
下载链接
链接失效反馈
官方服务:
资源简介:
Nemotron-RL-coding-competitive_coding数据集是一个仅包含Python的、基于推理的合成数据集。它包含竞争性编程风格的问题及其单元测试用例。这些问题和测试用例收集自CodeContests(deepmind/code_contests)和Open-R1(open-r1/codeforces)。该数据集作为NVIDIA NeMo Gym的一部分发布,NeMo Gym是一个用于构建强化学习环境以训练大型语言模型的框架。NeMo Gym包含越来越多的训练环境和数据集,以实现从可验证奖励的强化学习(RLVR)。NeMo Gym是NVIDIA NeMo框架中的一个开源库,NVIDIA NeMo是NVIDIA的GPU加速的端到端训练框架,用于大型语言模型(LLMs)、多模态模型和语音模型。该数据集已准备好用于商业用途。

The Nemotron-RL-coding-competitive_coding dataset is a python-only, reasoning-based, synthetic dataset. It contains competitive coding style problems and their unit test cases. These questions and test cases are collected from CodeContests (deepmind/code_contests), and Open-R1 (open-r1/codeforces). This dataset is released as part of NVIDIA NeMo Gym, a framework for building reinforcement learning environments to train large language models. NeMo Gym contains a growing collection of training environments and datasets to enable Reinforcement Learning from Verifiable Reward (RLVR). NeMo Gym is an open-source library within the NVIDIA NeMo framework, NVIDIAs GPU accelerated, end-to-end training framework for large language models (LLMs), multi-modal models and speech models. This dataset is ready for commercial use.
提供机构:
nvidia
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作