jz666/gemma2-ultrafeedback-reward-chosen-4
收藏Hugging Face2025-12-19 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/jz666/gemma2-ultrafeedback-reward-chosen-4
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,包括prompt_id(提示ID)、prompt(提示内容)、all_generated_responses(所有生成的响应)、all_rm_scores(所有评分模型的分数)、chosen(被选中的响应内容及角色)和rejected(被拒绝的响应内容及角色)。数据集分为测试集(test)和四个训练集(train_q1、train_q2、train_q3、train_q4),分别包含不同数量的样本和字节大小。
The dataset includes multiple features such as prompt_id, prompt, all_generated_responses, all_rm_scores, chosen (content and role of the selected response), and rejected (content and role of the rejected response). It is divided into a test set (test) and four training sets (train_q1, train_q2, train_q3, train_q4), each with varying numbers of samples and byte sizes.
提供机构:
jz666



