Atipico1/webq_test
收藏Hugging Face2024-04-22 更新2024-06-22 收录
下载链接:
https://hf-mirror.com/datasets/Atipico1/webq_test
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: adversary
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
- name: gpt_answer_sentence
dtype: string
- name: gpt_adv_sentence
sequence: string
- name: is_valid_adv_sentence
dtype: bool
- name: gpt_adv_passage
sequence: string
- name: is_valid_adv_passage
dtype: bool
splits:
- name: train
num_bytes: 16335230
num_examples: 2032
download_size: 9129827
dataset_size: 16335230
- config_name: adversary_v2
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
- name: gpt_answer_sentence
dtype: string
- name: gpt_adv_sentence
sequence: string
- name: is_valid_adv_sentence
dtype: bool
- name: gpt_adv_passage
sequence: string
- name: is_valid_adv_passage
dtype: bool
splits:
- name: train
num_bytes: 16335074
num_examples: 2032
download_size: 9121125
dataset_size: 16335074
- config_name: adversary_v2-sent
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: gpt_answer_sentence
dtype: string
- name: gpt_adv_sentence
sequence: string
- name: is_valid_adv_sentence
dtype: bool
- name: gpt_adv_passage
sequence: string
- name: is_valid_adv_passage
dtype: bool
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float32
- name: text
dtype: string
splits:
- name: train
num_bytes: 4579727
num_examples: 2032
download_size: 2507827
dataset_size: 4579727
- config_name: conflict
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
- name: gpt_answer_sentence
dtype: string
- name: entity_type
dtype: string
- name: similar_entity
dtype: string
- name: similar_entity_score
dtype: float32
- name: random_entity
dtype: string
- name: random_entity_score
dtype: float64
splits:
- name: train
num_bytes: 13758558
num_examples: 2032
download_size: 7878512
dataset_size: 13758558
- config_name: conflict_v1
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
- name: gpt_answer_sentence
dtype: string
- name: entity_type
dtype: string
- name: similar_entity
dtype: string
- name: similar_entity_score
dtype: float32
- name: random_entity
dtype: string
- name: random_entity_score
dtype: float64
- name: gpt_conflict_sentence
sequence: string
- name: is_valid_conflict_sentence
dtype: bool
- name: gpt_conflict_passage
sequence: string
- name: is_valid_conflict_passage
dtype: bool
splits:
- name: train
num_bytes: 14671132
num_examples: 2032
download_size: 8415624
dataset_size: 14671132
- config_name: conflict_v1-sent
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: gpt_answer_sentence
dtype: string
- name: entity_type
dtype: string
- name: similar_entity
dtype: string
- name: similar_entity_score
dtype: float32
- name: random_entity
dtype: string
- name: random_entity_score
dtype: float64
- name: gpt_conflict_sentence
sequence: string
- name: is_valid_conflict_sentence
dtype: bool
- name: gpt_conflict_passage
sequence: string
- name: is_valid_conflict_passage
dtype: bool
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float32
- name: text
dtype: string
splits:
- name: train
num_bytes: 2915604
num_examples: 2032
download_size: 1801711
dataset_size: 2915604
- config_name: default
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
splits:
- name: train
num_bytes: 13481387
num_examples: 2032
download_size: 7685748
dataset_size: 13481387
configs:
- config_name: adversary
data_files:
- split: train
path: adversary/train-*
- config_name: adversary_v2
data_files:
- split: train
path: adversary_v2/train-*
- config_name: adversary_v2-sent
data_files:
- split: train
path: adversary_v2-sent/train-*
- config_name: conflict
data_files:
- split: train
path: conflict/train-*
- config_name: conflict_v1
data_files:
- split: train
path: conflict_v1/train-*
- config_name: conflict_v1-sent
data_files:
- split: train
path: conflict_v1-sent/train-*
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
Atipico1
原始信息汇总
数据集概述
数据集配置
配置名称:adversary
- 特征:
question: 字符串answers: 字符串序列ctxs: 列表hasanswer: 布尔值score: 浮点数 (float64)text: 字符串title: 字符串
gpt_answer_sentence: 字符串gpt_adv_sentence: 字符串序列is_valid_adv_sentence: 布尔值gpt_adv_passage: 字符串序列is_valid_adv_passage: 布尔值
- 分割:
train:- 字节数:16335230
- 样本数:2032
- 下载大小:9129827
- 数据集大小:16335230
配置名称:adversary_v2
- 特征:
question: 字符串answers: 字符串序列ctxs: 列表hasanswer: 布尔值score: 浮点数 (float64)text: 字符串title: 字符串
gpt_answer_sentence: 字符串gpt_adv_sentence: 字符串序列is_valid_adv_sentence: 布尔值gpt_adv_passage: 字符串序列is_valid_adv_passage: 布尔值
- 分割:
train:- 字节数:16335074
- 样本数:2032
- 下载大小:9121125
- 数据集大小:16335074
配置名称:adversary_v2-sent
- 特征:
question: 字符串answers: 字符串序列gpt_answer_sentence: 字符串gpt_adv_sentence: 字符串序列is_valid_adv_sentence: 布尔值gpt_adv_passage: 字符串序列is_valid_adv_passage: 布尔值ctxs: 列表hasanswer: 布尔值score: 浮点数 (float32)text: 字符串
- 分割:
train:- 字节数:4579727
- 样本数:2032
- 下载大小:2507827
- 数据集大小:4579727
配置名称:conflict
- 特征:
question: 字符串answers: 字符串序列ctxs: 列表hasanswer: 布尔值score: 浮点数 (float64)text: 字符串title: 字符串
gpt_answer_sentence: 字符串entity_type: 字符串similar_entity: 字符串similar_entity_score: 浮点数 (float32)random_entity: 字符串random_entity_score: 浮点数 (float64)
- 分割:
train:- 字节数:13758558
- 样本数:2032
- 下载大小:7878512
- 数据集大小:13758558
配置名称:conflict_v1
- 特征:
question: 字符串answers: 字符串序列ctxs: 列表hasanswer: 布尔值score: 浮点数 (float64)text: 字符串title: 字符串
gpt_answer_sentence: 字符串entity_type: 字符串similar_entity: 字符串similar_entity_score: 浮点数 (float32)random_entity: 字符串random_entity_score: 浮点数 (float64)gpt_conflict_sentence: 字符串序列is_valid_conflict_sentence: 布尔值gpt_conflict_passage: 字符串序列is_valid_conflict_passage: 布尔值
- 分割:
train:- 字节数:14671132
- 样本数:2032
- 下载大小:8415624
- 数据集大小:14671132
配置名称:conflict_v1-sent
- 特征:
question: 字符串answers: 字符串序列gpt_answer_sentence: 字符串entity_type: 字符串similar_entity: 字符串similar_entity_score: 浮点数 (float32)random_entity: 字符串random_entity_score: 浮点数 (float64)gpt_conflict_sentence: 字符串序列is_valid_conflict_sentence: 布尔值gpt_conflict_passage: 字符串序列is_valid_conflict_passage: 布尔值ctxs: 列表hasanswer: 布尔值score: 浮点数 (float32)text: 字符串
- 分割:
train:- 字节数:2915604
- 样本数:2032
- 下载大小:1801711
- 数据集大小:2915604
配置名称:default
- 特征:
question: 字符串answers: 字符串序列ctxs: 列表hasanswer: 布尔值score: 浮点数 (float64)text: 字符串title: 字符串
- 分割:
train:- 字节数:13481387
- 样本数:2032
- 下载大小:7685748
- 数据集大小:13481387



