mothnaZl/self_rewarding_sft_prompt_turn3_Qwen2.5-7B-Instruct_wrong

Name: mothnaZl/self_rewarding_sft_prompt_turn3_Qwen2.5-7B-Instruct_wrong
Creator: mothnaZl
Published: 2025-04-06 00:43:22
License: 暂无描述

Hugging Face2025-04-06 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/mothnaZl/self_rewarding_sft_prompt_turn3_Qwen2.5-7B-Instruct_wrong

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了三个字段：gt（字符串），prompt_messages（包含内容和角色的列表），first_reward（布尔值）。它有一个训练集，包含1111个示例，总大小为5223576字节。数据集的下载大小为1712891字节。

The dataset consists of three fields: gt (string), prompt_messages (a list containing content and role), and first_reward (boolean). It has a training set with 1111 examples, totaling 5223576 bytes in size. The download size of the dataset is 1712891 bytes.

提供机构：

mothnaZl

5,000+

优质数据集

54 个

任务类型

进入经典数据集