clembench-playpen/DPO_allneg_Aborted_best_models_FINAL
收藏Hugging Face2025-03-16 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/clembench-playpen/DPO_allneg_Aborted_best_models_FINAL
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个游戏相关的字段,如游戏ID、版本、实验信息、环节等,以及模型成功和失败的标识。chosen和rejected字段可能表示模型在游戏中做出的选择和拒绝的内容。数据集被划分为训练集,适用于机器学习模型的训练。具体游戏内容、模型任务和详细应用场景未在README中说明。
The dataset comprises multiple game-related fields such as game ID, version, experiment information, episode, and indicators for model success and failure. The chosen and rejected fields might represent the models selections and rejections in the game. The dataset is split into a training set, suitable for training machine learning models. The specific game content, model task, and detailed application scenario are not described in the README.
提供机构:
clembench-playpen



