teamcore/DPO_Pm3B_U0_beta0.25sigmoidEurus_RM_7bbt_noise_flip_paper0.3g
收藏Hugging Face2025-10-23 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/teamcore/DPO_Pm3B_U0_beta0.25sigmoidEurus_RM_7bbt_noise_flip_paper0.3g
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为test_run_rebuttal_noisefixed,包含源文本(source)、指令(instruction)、模型(models)等多个字段。在completions字段中,有关于注释(annotations)、批评(critique)、自定义系统提示(custom_system_prompt)等信息。数据集还包含正确答案(correct_answers)、错误答案(incorrect_answers)、prompt等字段。此外,还提供了模型的评分、选择和拒绝的评分等信息。数据集分为default一个部分,共有1000个示例。
The dataset named test_run_rebuttal_noisefixed includes fields such as source text (source), instructions (instruction), models (models), etc. In the completions field, there are information about annotations, critiques, custom system prompts, etc. The dataset also contains fields like correct answers (correct_answers), incorrect answers (incorrect_answers), prompt, etc. Moreover, it provides information on model ratings, scores for chosen and rejected responses, etc. The dataset is divided into a single part called default, with a total of 1000 examples.
提供机构:
teamcore



