kaiwenw/dec9_sp1_repeat_5_pref_jdpo_all_reject_first
收藏Hugging Face2024-12-10 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/kaiwenw/dec9_sp1_repeat_5_pref_jdpo_all_reject_first
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含用于训练和验证的示例,每个示例包括一个提示(prompt)、一个被选择的选项(chosen)、一个被拒绝的选项(rejected)、被选择选项的偏好(chosen_pref)、被拒绝选项的偏好(rejected_pref)以及分割后缀(split_suffix)。数据集分为训练集和验证集,训练集包含24458个示例,验证集包含2280个示例。这些字段可能用于某种形式的偏好或选择任务。
This dataset contains examples for training and validation, each including a prompt, a chosen option, a rejected option, the preference for the chosen option, the preference for the rejected option, and a split suffix. The dataset is divided into a training set with 24,458 examples and a validation set with 2,280 examples. These fields may be used for some form of preference or selection task.
提供机构:
kaiwenw



