ktolnos/helpsteer3_goldSkywork-Reward-V2-Llama-3.1-8B-10k
收藏Hugging Face2025-10-22 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/ktolnos/helpsteer3_goldSkywork-Reward-V2-Llama-3.1-8B-10k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了用户偏好的强度、选择的选项和拒绝的选项(包括内容和角色)、选择的奖励、拒绝的奖励以及是否金标准与原始选择一致等信息。数据集分为训练集和测试集两部分。
The dataset includes user preference strength, chosen and rejected options (including content and role), reward for choosing, reward for rejecting, and whether the gold standard agrees with the original choice. The dataset is split into training and test sets.
提供机构:
ktolnos



