nvidia/Nemotron-Cascade-RL-RLHF

Name: nvidia/Nemotron-Cascade-RL-RLHF
Creator: nvidia
Published: 2025-12-16 02:13:32
License: 暂无描述

Hugging Face2025-12-16 更新2025-12-20 收录

下载链接：

https://hf-mirror.com/datasets/nvidia/Nemotron-Cascade-RL-RLHF

下载链接

链接失效反馈

官方服务：

资源简介：

Nemotron-Cascade-RL-RLHF数据集专为人类反馈强化学习（RLHF）训练设计，包含提示和相关元数据，用于支持语言模型对齐的开发。该数据集已准备好用于商业用途。数据集包含45,882个样本，用于RLHF训练，包括提示、数据来源和类别信息。数据集来源于多个子数据集，如HelpSteer 2、HelpSteer 3和WildGuard。数据集格式为Parquet，结构包含文本和元数据，列包括提示、数据来源、索引、类别和分类标签。数据集创建于2025年12月15日，采用Creative Commons Attribution 4.0 International License (CC BY 4.0)许可。

The Nemotron-Cascade-RL-RLHF dataset is designed for Reinforcement Learning from Human Feedback (RLHF) training. It contains prompts and associated metadata to support the development of language model alignment. This dataset is ready for commercial use. The dataset contains 45,882 samples used for RLHF training, including prompts, data sources, and category information. It is a curated subset of datasets from HelpSteer 2, HelpSteer 3, and WildGuard. The dataset is in Parquet format, structured as text plus metadata, with columns including prompt, data_source, index, category, and cat. The dataset was created on Dec 15, 2025, and is governed by the Creative Commons Attribution 4.0 International License (CC BY 4.0).

提供机构：

nvidia

5,000+

优质数据集

54 个

任务类型

进入经典数据集