dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_2_Skywork-Reward-Gemma-2-27B-v0.2
收藏Hugging Face2024-12-16 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_2_Skywork-Reward-Gemma-2-27B-v0.2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集涉及使用拒绝采样算法对模型输出进行筛选和评分。配置信息显示,数据集使用了Skywork/Skywork-Reward-Gemma-2-27B-v0.2模型进行评分,并且包含了参考完成内容以进行拒绝采样。运行命令进一步说明了数据集的生成过程,包括输入文件、输出文件、批处理大小、GPU数量等参数。
This dataset involves the use of a rejection sampling algorithm to filter and score model outputs. The configuration information indicates that the dataset uses the Skywork/Skywork-Reward-Gemma-2-27B-v0.2 model for scoring and includes reference completions for rejection sampling. The run command further details the dataset generation process, including input file, output file, batch size, number of GPUs, and other parameters.
提供机构:
dogtooth



