RLHFlow/self_rewarding_turn2_example

Name: RLHFlow/self_rewarding_turn2_example
Creator: RLHFlow
Published: 2025-03-02 22:28:27
License: 暂无描述

Hugging Face2025-03-02 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/RLHFlow/self_rewarding_turn2_example

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了答案、正确答案、是否给予第一次奖励以及提示信息（包括内容和角色）等字段。训练集共有44968个样本，数据集大小为409228915字节。

The dataset includes fields for answers, correct answers, whether the first reward is given, and prompt messages (including content and role). The training set contains 44968 samples, with a total dataset size of 409228915 bytes.

提供机构：

RLHFlow

5,000+

优质数据集

54 个

任务类型

进入经典数据集