teamcore/DPO_Pm3B_U0_beta0.25dpo_proEurus_RM_7bbt_noise_flip0.3g
收藏Hugging Face2025-10-22 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/teamcore/DPO_Pm3B_U0_beta0.25dpo_proEurus_RM_7bbt_noise_flip0.3g
下载链接
链接失效反馈官方服务:
资源简介:
这是一个用于评估模型表现的数据集,包含了对模型完成任务的多个维度的评分和反馈信息,如帮助性、诚实性、指导遵循性和真实性等。数据集包含一个split,共有100个示例。
This is a dataset for evaluating model performance, containing ratings and feedback on multiple dimensions of task completion, such as helpfulness, honesty, instruction following, and truthfulness. The dataset includes one split with a total of 100 examples.
提供机构:
teamcore



