clembench-playpen/DPO_2neg_Aborted_same_family_model_FINAL
收藏Hugging Face2025-03-16 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/clembench-playpen/DPO_2neg_Aborted_same_family_model_FINAL
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含游戏相关数据的训练集,其中包括游戏ID、版本、实验名称、回合信息,以及模型在游戏中的成功和失败标记。每个示例还包括了选中的(chosen)和拒绝的(rejected)内容、角色和类型信息。
This is a training dataset containing game-related data, including game ID, version, experiment name, episode information, and model success and failure marks in the game. Each example also includes chosen and rejected content, role, and type information.
提供机构:
clembench-playpen



