alvinming/frames-wrong-ans-exp-filter
收藏Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/alvinming/frames-wrong-ans-exp-filter
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含来自不同AI模型(如GPT和Gemini系列)对问题的回答,以及对这些回答的评分和推理过程。数据集特征包括运行标识、问题、真实答案、完整回答、推理过程、最终答案、评分、评分理由、引用URL、评委候选分析及最终裁决等。数据集分为多个子集,分别对应不同的模型版本。
This dataset contains responses from various AI models (such as GPT and Gemini series) to questions, along with grading and reasoning about those responses. The features include run identifier, question, ground truth, full response, reasoning, final answer, grade, grader reasoning, cited URLs, judge candidate analysis, and final verdict. The dataset is divided into several subsets corresponding to different model versions.
提供机构:
alvinming



