dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_4_Skywork-Reward-Gemma-2-27B-v0.2
收藏Hugging Face2024-12-16 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/dogtooth/uf_Llama-3.1-Tulu-3-8B-SFT_4_Skywork-Reward-Gemma-2-27B-v0.2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是allenai/open_instruct项目中的一个子集,专注于拒绝采样(Rejection Sampling)方法。它涉及使用Skywork/Skywork-Reward-Gemma-2-27B-v0.2模型进行判断和评分,配置中包括输入文件路径、模型路径、保存文件路径等详细信息。数据集的具体内容未明确描述,但推断其可能用于模型训练和评估中的拒绝采样过程。
This dataset is a subset of the allenai/open_instruct project, focusing on the Rejection Sampling method. It involves using the Skywork/Skywork-Reward-Gemma-2-27B-v0.2 model for judgment and scoring, with configurations including input file paths, model paths, save file paths, and other detailed information. The specific content of the dataset is not explicitly described, but it is inferred to be used in the rejection sampling process of model training and evaluation.
提供机构:
dogtooth



