clembench-playpen/DPO_1neg_Aborted_same_family_model_old_LA
收藏Hugging Face2025-03-23 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/clembench-playpen/DPO_1neg_Aborted_same_family_model_old_LA
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含游戏相关数据的训练集,其中包括游戏名称、游戏ID、版本、实验名称、回合信息、模型成功与否的标记、提示信息以及被选择和拒绝的内容信息(包括内容、角色和类型)。数据集分为训练集,共有3735个示例,总大小为14,468,753字节。
This is a training set containing game-related data, including game name, game ID, version, experiment name, episode information, model success or failure marks, prompt information, and chosen and rejected content information (including content, role, and type). The dataset is split into a training set with a total of 3,735 examples and a total size of 14,468,753 bytes.
提供机构:
clembench-playpen



