MoeReward/combined_rlhf_dataset_grpo_nq_main

Name: MoeReward/combined_rlhf_dataset_grpo_nq_main
Creator: MoeReward
Published: 2025-04-01 22:59:26
License: 暂无描述

Hugging Face2025-04-01 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/MoeReward/combined_rlhf_dataset_grpo_nq_main

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含两个主要特征：答案（answer）和提示（prompt），均为字符串类型。数据集分为训练集，共有3999个示例，占据2335636.885933651字节的空间。数据集的下载大小为652557字节。根据这些信息，可以推断这是一个文本数据集，可能用于训练某种文本生成或文本理解模型。

The dataset includes two main features: answer and prompt, both of which are string types. The dataset is split into a training set with a total of 3999 examples, occupying 2335636.885933651 bytes of space. The download size of the dataset is 652557 bytes. Based on this information, it can be inferred that this is a text dataset, possibly used for training some text generation or text understanding models.

提供机构：

MoeReward

5,000+

优质数据集

54 个

任务类型

进入经典数据集