sanderland/arena-expert-5k
收藏Hugging Face2025-12-11 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/sanderland/arena-expert-5k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含专家在纯文本类别中收集的投票数据。每行数据代表一个投票,用于评估两个模型(model_a和model_b)在用户对话上的表现,并包含完整的对话历史。关键字段包括:唯一反馈ID(id)、当前投票的评估顺序(evaluation_order)、战斗结果(winner)、当前评估顺序的完整对话(conversation_a/conversation_b)、包括所有先前评估顺序的上下文提示和答案的整个对话(full_conversation)以及标记到每个对话的职业类别(occupational_tags)。
This dataset contains expert votes collected in the text-only category. Each row represents a single vote judging two models (model_a and model_b) on a user conversation, along with the full conversation history. Key fields include: `id` (Unique feedback ID of each vote/row), `evaluation_order` (Evaluation order of the current vote), `winner` (Battle result containing either model_a, model_b, tie, or both_bad), `conversation_a/conversation_b` (Full conversation of the current evaluation order), `full_conversation` (The entire conversation, including context prompts and answers from all previous evaluation orders), and `occupational_tags` (The occupational categories tagged to each conversation).
提供机构:
sanderland



