clembench-playpen/DPO_dialogue_1neg_old
收藏Hugging Face2025-05-07 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/clembench-playpen/DPO_dialogue_1neg_old
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集包含多个字段,如游戏名称、游戏ID、版本、实验类型、回合、模型成功与否的标记、提示信息、玩家信息以及玩家选择和拒绝的内容和角色。数据集分为训练集,其大小为25013760字节,共有6696个示例。数据集的总大小为25013760字节,下载大小为974446字节。
The dataset contains multiple fields such as game name, game ID, version, experiment type, episode, model success or failure markers, prompt information, player information, and the content and role of the players choice and rejection. The dataset is split into a training set, which is 25013760 bytes in size and contains 6696 examples. The total size of the dataset is 25013760 bytes, and the download size is 974446 bytes.
提供机构:
clembench-playpen



