alvinming/browsecomp-wrong-ans-exp-filter
收藏Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/alvinming/browsecomp-wrong-ans-exp-filter
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: run
dtype: string
- name: question
dtype: string
- name: ground_truth
dtype: string
- name: full_response
dtype: string
- name: reasoning
dtype: string
- name: final_answer
dtype: string
- name: grade
dtype: string
- name: grader_reasoning
dtype: string
- name: cited_urls
dtype: string
- name: judge_candidate_1_analysis
dtype: string
- name: judge_candidate_2_analysis
dtype: string
- name: judge_final_verdict
dtype: string
splits:
- name: gpt_5
num_bytes: 2535811
num_examples: 467
- name: gpt_5_mini
num_bytes: 2007348
num_examples: 358
- name: gemini_2_5_flash
num_bytes: 6851134
num_examples: 628
- name: gemini_2_5_pro
num_bytes: 9887836
num_examples: 1174
download_size: 11210001
dataset_size: 21282129
configs:
- config_name: default
data_files:
- split: gpt_5
path: data/gpt_5-*
- split: gpt_5_mini
path: data/gpt_5_mini-*
- split: gemini_2_5_flash
path: data/gemini_2_5_flash-*
- split: gemini_2_5_pro
path: data/gemini_2_5_pro-*
---
数据集信息:
特征字段:
- 字段名:run,数据类型:字符串
- 字段名:question,数据类型:字符串
- 字段名:ground_truth(基准真值),数据类型:字符串
- 字段名:full_response,数据类型:字符串
- 字段名:reasoning,数据类型:字符串
- 字段名:final_answer,数据类型:字符串
- 字段名:grade,数据类型:字符串
- 字段名:grader_reasoning,数据类型:字符串
- 字段名:cited_urls,数据类型:字符串
- 字段名:judge_candidate_1_analysis,数据类型:字符串
- 字段名:judge_candidate_2_analysis,数据类型:字符串
- 字段名:judge_final_verdict,数据类型:字符串
数据拆分:
- 名称:GPT-5,字节数:2535811,样本数:467
- 名称:GPT-5 Mini,字节数:2007348,样本数:358
- 名称:Gemini 2.5 Flash,字节数:6851134,样本数:628
- 名称:Gemini 2.5 Pro,字节数:9887836,样本数:1174
下载大小:11210001
数据集总大小:21282129
配置项:
- 配置名称:default(默认配置)
数据文件:
- 拆分子集:GPT-5,路径:data/gpt_5-*
- 拆分子集:GPT-5 Mini,路径:data/gpt_5_mini-*
- 拆分子集:Gemini 2.5 Flash,路径:data/gemini_2_5_flash-*
- 拆分子集:Gemini 2.5 Pro,路径:data/gemini_2_5_pro-*
提供机构:
alvinming



