dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_8_Skywork-Reward-Gemma-2-27B-v0.2

Name: dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_8_Skywork-Reward-Gemma-2-27B-v0.2
Creator: dogtooth
Published: 2024-12-15 21:52:20
License: 暂无描述

Hugging Face2024-12-15 更新2024-12-21 收录

下载链接：

https://hf-mirror.com/datasets/dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_8_Skywork-Reward-Gemma-2-27B-v0.2

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是allenai/open_instruct项目中的Rejection Sampling Dataset，主要用于通过拒绝采样算法进行模型训练和评估。配置信息显示，该数据集使用了Skywork/Skywork-Reward-Gemma-2-27B-v0.2模型，并设置了多个参数如最大前向批量大小、完成次数、GPU数量等。运行命令进一步展示了如何使用该数据集进行模型训练和评估。

This dataset is part of the allenai/open_instruct project, specifically the Rejection Sampling Dataset, which is used for model training and evaluation through the rejection sampling algorithm. The configuration information indicates that the dataset utilizes the Skywork/Skywork-Reward-Gemma-2-27B-v0.2 model and sets various parameters such as maximum forward batch size, number of completions, and number of GPUs. The run command further demonstrates how to use this dataset for model training and evaluation.

提供机构：

dogtooth