dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_5_Skywork-Reward-Gemma-2-27B-v0.2

Name: dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_5_Skywork-Reward-Gemma-2-27B-v0.2
Creator: dogtooth
Published: 2024-12-18 18:26:20
License: 暂无描述

Hugging Face2024-12-18 更新2024-12-21 收录

下载链接：

https://hf-mirror.com/datasets/dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_5_Skywork-Reward-Gemma-2-27B-v0.2

下载链接

链接失效反馈

官方服务：

资源简介：

allenai/open_instruct数据集是一个用于拒绝采样的数据集，包含prompt、response和score三个特征。数据集的分割部分包含一个名为train_weighted_sft的分割，该分割包含122,270个样本。数据集的生成使用了rejection_sampling.py脚本，脚本的输入文件、模型路径、保存文件名等参数在README中有详细描述。

The dataset includes three features: prompt, response, and score, representing the input prompt, the generated response, and the score respectively. The dataset is divided into a training set with 122270 samples.

提供机构：

dogtooth

5,000+

优质数据集

54 个

任务类型

进入经典数据集