Atipico1/nq-test
收藏Hugging Face2024-04-18 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/Atipico1/nq-test
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: adversary
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
- name: gpt_answer_sentence
dtype: string
- name: gpt_adv_sentence
sequence: string
- name: is_valid_adv_sentence
dtype: bool
- name: gpt_adv_passage
sequence: string
- name: is_valid_adv_passage
dtype: bool
splits:
- name: train
num_bytes: 28520709
num_examples: 3610
download_size: 16013125
dataset_size: 28520709
- config_name: adversary-sent
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: gpt_answer_sentence
dtype: string
- name: gpt_adv_sentence
sequence: string
- name: is_valid_adv_sentence
dtype: bool
- name: gpt_adv_passage
sequence: string
- name: is_valid_adv_passage
dtype: bool
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float32
- name: text
dtype: string
splits:
- name: train
num_bytes: 7841766
num_examples: 3610
download_size: 4333156
dataset_size: 7841766
- config_name: adversary_v2
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
- name: gpt_answer_sentence
dtype: string
- name: gpt_adv_sentence
sequence: string
- name: is_valid_adv_sentence
dtype: bool
- name: gpt_adv_passage
sequence: string
- name: is_valid_adv_passage
dtype: bool
splits:
- name: train
num_bytes: 28520588
num_examples: 3610
download_size: 16014456
dataset_size: 28520588
- config_name: adversary_v2-sent
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: gpt_answer_sentence
dtype: string
- name: gpt_adv_sentence
sequence: string
- name: is_valid_adv_sentence
dtype: bool
- name: gpt_adv_passage
sequence: string
- name: is_valid_adv_passage
dtype: bool
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float32
- name: text
dtype: string
splits:
- name: train
num_bytes: 7946175
num_examples: 3610
download_size: 4474343
dataset_size: 7946175
- config_name: adversary_v3
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
- name: gpt_answer_sentence
dtype: string
- name: gpt_adv_sentence
sequence: string
- name: is_valid_adv_sentence
dtype: bool
- name: gpt_adv_passage
sequence: string
- name: is_valid_adv_passage
dtype: bool
splits:
- name: train
num_bytes: 28520588
num_examples: 3610
download_size: 16014456
dataset_size: 28520588
- config_name: conflict
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
- name: gpt_answer_sentence
dtype: string
- name: entity_type
dtype: string
- name: similar_entity
dtype: string
- name: similar_entity_score
dtype: float32
- name: random_entity
dtype: string
- name: random_entity_score
dtype: float64
splits:
- name: train
num_bytes: 24192330
num_examples: 3610
download_size: 13890009
dataset_size: 24192330
- config_name: conflict_v1
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
- name: gpt_answer_sentence
dtype: string
- name: entity_type
dtype: string
- name: similar_entity
dtype: string
- name: similar_entity_score
dtype: float32
- name: random_entity
dtype: string
- name: random_entity_score
dtype: float64
- name: gpt_conflict_sentence
sequence: string
- name: is_valid_conflict_sentence
dtype: bool
- name: gpt_conflict_passage
sequence: string
- name: is_valid_conflict_passage
dtype: bool
splits:
- name: train
num_bytes: 25835423
num_examples: 3610
download_size: 14872958
dataset_size: 25835423
- config_name: conflict_v1-sent
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: gpt_answer_sentence
dtype: string
- name: entity_type
dtype: string
- name: similar_entity
dtype: string
- name: similar_entity_score
dtype: float32
- name: random_entity
dtype: string
- name: random_entity_score
dtype: float64
- name: gpt_conflict_sentence
sequence: string
- name: is_valid_conflict_sentence
dtype: bool
- name: gpt_conflict_passage
sequence: string
- name: is_valid_conflict_passage
dtype: bool
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float32
- name: text
dtype: string
splits:
- name: train
num_bytes: 5261010
num_examples: 3610
download_size: 3332793
dataset_size: 5261010
- config_name: default
features:
- name: question
dtype: string
- name: answers
sequence: string
- name: ctxs
list:
- name: hasanswer
dtype: bool
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
splits:
- name: train
num_bytes: 23673906
num_examples: 3610
download_size: 13529716
dataset_size: 23673906
configs:
- config_name: adversary
data_files:
- split: train
path: adversary/train-*
- config_name: adversary-sent
data_files:
- split: train
path: adversary-sent/train-*
- config_name: adversary_v2
data_files:
- split: train
path: adversary_v2/train-*
- config_name: adversary_v2-sent
data_files:
- split: train
path: adversary_v2-sent/train-*
- config_name: adversary_v3
data_files:
- split: train
path: adversary_v3/train-*
- config_name: conflict
data_files:
- split: train
path: conflict/train-*
- config_name: conflict_v1
data_files:
- split: train
path: conflict_v1/train-*
- config_name: conflict_v1-sent
data_files:
- split: train
path: conflict_v1-sent/train-*
- config_name: default
data_files:
- split: train
path: data/train-*
---
提供机构:
Atipico1
原始信息汇总
数据集概述
数据集配置 adversary
- 特征:
question: 数据类型为字符串answers: 数据类型为字符串序列ctxs: 列表类型,包含以下子特征:hasanswer: 数据类型为布尔值score: 数据类型为浮点数64位text: 数据类型为字符串title: 数据类型为字符串
gpt_answer_sentence: 数据类型为字符串gpt_adv_sentence: 数据类型为字符串序列is_valid_adv_sentence: 数据类型为布尔值gpt_adv_passage: 数据类型为字符串序列is_valid_adv_passage: 数据类型为布尔值
- 分割:
train: 数据大小为28520709字节,包含3610个示例- 下载大小: 16013125字节
- 数据集大小: 28520709字节
数据集配置 adversary-sent
- 特征:
question: 数据类型为字符串answers: 数据类型为字符串序列gpt_answer_sentence: 数据类型为字符串gpt_adv_sentence: 数据类型为字符串序列is_valid_adv_sentence: 数据类型为布尔值gpt_adv_passage: 数据类型为字符串序列is_valid_adv_passage: 数据类型为布尔值ctxs: 列表类型,包含以下子特征:hasanswer: 数据类型为布尔值score: 数据类型为浮点数32位text: 数据类型为字符串
- 分割:
train: 数据大小为7841766字节,包含3610个示例- 下载大小: 4333156字节
- 数据集大小: 7841766字节
数据集配置 adversary_v2
- 特征:
question: 数据类型为字符串answers: 数据类型为字符串序列ctxs: 列表类型,包含以下子特征:hasanswer: 数据类型为布尔值score: 数据类型为浮点数64位text: 数据类型为字符串title: 数据类型为字符串
gpt_answer_sentence: 数据类型为字符串gpt_adv_sentence: 数据类型为字符串序列is_valid_adv_sentence: 数据类型为布尔值gpt_adv_passage: 数据类型为字符串序列is_valid_adv_passage: 数据类型为布尔值
- 分割:
train: 数据大小为28520588字节,包含3610个示例- 下载大小: 16014456字节
- 数据集大小: 28520588字节
数据集配置 adversary_v2-sent
- 特征:
question: 数据类型为字符串answers: 数据类型为字符串序列gpt_answer_sentence: 数据类型为字符串gpt_adv_sentence: 数据类型为字符串序列is_valid_adv_sentence: 数据类型为布尔值gpt_adv_passage: 数据类型为字符串序列is_valid_adv_passage: 数据类型为布尔值ctxs: 列表类型,包含以下子特征:hasanswer: 数据类型为布尔值score: 数据类型为浮点数32位text: 数据类型为字符串
- 分割:
train: 数据大小为7946175字节,包含3610个示例- 下载大小: 4474343字节
- 数据集大小: 7946175字节
数据集配置 adversary_v3
- 特征:
question: 数据类型为字符串answers: 数据类型为字符串序列ctxs: 列表类型,包含以下子特征:hasanswer: 数据类型为布尔值score: 数据类型为浮点数64位text: 数据类型为字符串title: 数据类型为字符串
gpt_answer_sentence: 数据类型为字符串gpt_adv_sentence: 数据类型为字符串序列is_valid_adv_sentence: 数据类型为布尔值gpt_adv_passage: 数据类型为字符串序列is_valid_adv_passage: 数据类型为布尔值
- 分割:
train: 数据大小为28520588字节,包含3610个示例- 下载大小: 16014456字节
- 数据集大小: 28520588字节
数据集配置 conflict
- 特征:
question: 数据类型为字符串answers: 数据类型为字符串序列ctxs: 列表类型,包含以下子特征:hasanswer: 数据类型为布尔值score: 数据类型为浮点数64位text: 数据类型为字符串title: 数据类型为字符串
gpt_answer_sentence: 数据类型为字符串entity_type: 数据类型为字符串similar_entity: 数据类型为字符串similar_entity_score: 数据类型为浮点数32位random_entity: 数据类型为字符串random_entity_score: 数据类型为浮点数64位
- 分割:
train: 数据大小为24192330字节,包含3610个示例- 下载大小: 13890009字节
- 数据集大小: 24192330字节
数据集配置 conflict_v1
- 特征:
question: 数据类型为字符串answers: 数据类型为字符串序列ctxs: 列表类型,包含以下子特征:hasanswer: 数据类型为布尔值score: 数据类型为浮点数64位text: 数据类型为字符串title: 数据类型为字符串
gpt_answer_sentence: 数据类型为字符串entity_type: 数据类型为字符串similar_entity: 数据类型为字符串similar_entity_score: 数据类型为浮点数32位random_entity: 数据类型为字符串random_entity_score: 数据类型为浮点数64位gpt_conflict_sentence: 数据类型为字符串序列is_valid_conflict_sentence: 数据类型为布尔值gpt_conflict_passage: 数据类型为字符串序列is_valid_conflict_passage: 数据类型为布尔值
- 分割:
train: 数据大小为25835423字节,包含3610个示例- 下载大小: 14872958字节
- 数据集大小: 25835423字节
数据集配置 conflict_v1-sent
- 特征:
question: 数据类型为字符串answers: 数据类型为字符串序列gpt_answer_sentence: 数据类型为字符串entity_type: 数据类型为字符串similar_entity: 数据类型为字符串similar_entity_score: 数据类型为浮点数32位random_entity: 数据类型为字符串random_entity_score: 数据类型为浮点数64位gpt_conflict_sentence: 数据类型为字符串序列is_valid_conflict_sentence: 数据类型为布尔值gpt_conflict_passage: 数据类型为字符串序列is_valid_conflict_passage: 数据类型为布尔值ctxs: 列表类型,包含以下子特征:hasanswer: 数据类型为布尔值score: 数据类型为浮点数32位text: 数据类型为字符串
- 分割:
train: 数据大小为5261010字节,包含3610个示例- 下载大小: 3332793字节
- 数据集大小: 5261010字节
数据集配置 default
- 特征:
question: 数据类型为字符串answers: 数据类型为字符串序列ctxs: 列表类型,包含以下子特征:hasanswer: 数据类型为布尔值score: 数据类型为浮点数64位text: 数据类型为字符串title: 数据类型为字符串
- 分割:
train: 数据大小为23673906字节,包含3610个示例- 下载大小: 13529716字节
- 数据集大小: 23673906字节



