dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_6_Skywork-Reward-Gemma-2-27B-v0.2

Name: dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_6_Skywork-Reward-Gemma-2-27B-v0.2
Creator: dogtooth
Published: 2024-12-17 23:26:22
License: 暂无描述

Hugging Face2024-12-17 更新2024-12-21 收录

下载链接：

https://hf-mirror.com/datasets/dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_6_Skywork-Reward-Gemma-2-27B-v0.2

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是allenai/open_instruct项目中的一部分，专注于拒绝采样算法。它可能用于训练或评估模型在特定任务上的表现，特别是通过拒绝采样方法来优化模型的输出。数据集的具体内容未在README中详细描述，但根据配置参数和运行命令，可以推测它涉及使用Skywork/Skywork-Reward-Gemma-2-27B-v0.2模型进行多轮完成和评分。

This dataset is part of the allenai/open_instruct project, focusing on the rejection sampling algorithm. It is likely used for training or evaluating model performance on specific tasks, particularly by optimizing model outputs through rejection sampling methods. The specific content of the dataset is not detailed in the README, but based on the configuration parameters and running commands, it can be inferred that it involves using the Skywork/Skywork-Reward-Gemma-2-27B-v0.2 model for multiple completions and scoring.

提供机构：

dogtooth

5,000+

优质数据集

54 个

任务类型

进入经典数据集