mothnaZl/self_rewarding_sft_prompt_turn3_Qwen2.5-7B-Instruct_correct

Name: mothnaZl/self_rewarding_sft_prompt_turn3_Qwen2.5-7B-Instruct_correct
Creator: mothnaZl
Published: 2025-04-06 00:43:33
License: 暂无描述

Hugging Face2025-04-06 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/mothnaZl/self_rewarding_sft_prompt_turn3_Qwen2.5-7B-Instruct_correct

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含了一个字符串类型的gt字段，一个包含内容和角色信息的prompt_messages列表，以及一个布尔类型的first_reward字段。数据集分为训练集，共有141888个示例，总大小为327576716字节。

The dataset includes a string type gt field, a prompt_messages list containing content and role information, and a boolean first_reward field. The dataset is split into a training set with a total of 141888 examples and a total size of 327576716 bytes.

提供机构：

mothnaZl

5,000+

优质数据集

54 个

任务类型

进入经典数据集