Xnhyacinth/NQ-Image
收藏Hugging Face2023-11-22 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Xnhyacinth/NQ-Image
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
- config_name: ctxs1
features:
- name: id
dtype: int64
- name: answers
sequence: string
- name: question
dtype: string
- name: compressed_prompt
struct:
- name: compressed_prompt
dtype: string
- name: compressed_tokens
dtype: int64
- name: origin_tokens
dtype: int64
- name: ratio
dtype: string
- name: saving
dtype: string
- name: ctxs
list:
- name: id
dtype: string
- name: text
dtype: string
- name: title
dtype: string
splits:
- name: train
num_bytes: 5212377086
num_examples: 79168
- name: eval
num_bytes: 576466670
num_examples: 8757
- name: test
num_bytes: 238448436
num_examples: 3610
download_size: 3334114023
dataset_size: 6027292192
- config_name: ctxs100
features:
- name: question
dtype: string
- name: compressed_prompt
struct:
- name: compressed_prompt
dtype: string
- name: compressed_tokens
dtype: int64
- name: origin_tokens
dtype: int64
- name: ratio
dtype: string
- name: saving
dtype: string
- name: answers
sequence: string
- name: id
dtype: int64
- name: ctxs
list:
- name: id
dtype: string
- name: text
dtype: string
- name: title
dtype: string
splits:
- name: train
num_bytes: 5316136683
num_examples: 79168
- name: eval
num_bytes: 587931406
num_examples: 8757
- name: test
num_bytes: 243224578
num_examples: 3610
download_size: 3413758169
dataset_size: 6147292667
- config_name: ctxs5
features:
- name: id
dtype: int64
- name: answers
sequence: string
- name: question
dtype: string
- name: compressed_prompt
struct:
- name: compressed_prompt
dtype: string
- name: compressed_tokens
dtype: int64
- name: origin_tokens
dtype: int64
- name: ratio
dtype: string
- name: saving
dtype: string
- name: ctxs
list:
- name: id
dtype: string
- name: score
dtype: float64
- name: text
dtype: string
- name: title
dtype: string
splits:
- name: train
num_bytes: 5379479786
num_examples: 79168
- name: eval
num_bytes: 594986589
num_examples: 8757
- name: test
num_bytes: 246104192
num_examples: 3610
download_size: 3408308518
dataset_size: 6220570567
configs:
- config_name: ctxs1
data_files:
- split: train
path: ctxs1/train-*
- split: eval
path: ctxs1/eval-*
- split: test
path: ctxs1/test-*
- config_name: ctxs100
data_files:
- split: train
path: ctxs100/train-*
- split: eval
path: ctxs100/eval-*
- split: test
path: ctxs100/test-*
- config_name: ctxs5
data_files:
- split: train
path: ctxs5/train-*
- split: eval
path: ctxs5/eval-*
- split: test
path: ctxs5/test-*
---
提供机构:
Xnhyacinth
原始信息汇总
数据集概述
数据集配置
-
config_name: ctxs1
- 特征:
id: 类型int64answers: 序列类型stringquestion: 类型stringcompressed_prompt: 结构类型compressed_prompt: 类型stringcompressed_tokens: 类型int64origin_tokens: 类型int64ratio: 类型stringsaving: 类型string
ctxs: 列表类型id: 类型stringtext: 类型stringtitle: 类型string
- 分割:
train: 字节数5212377086, 样本数79168eval: 字节数576466670, 样本数8757test: 字节数238448436, 样本数3610
- 下载大小:
3334114023 - 数据集大小:
6027292192
- 特征:
-
config_name: ctxs100
- 特征:
question: 类型stringcompressed_prompt: 结构类型compressed_prompt: 类型stringcompressed_tokens: 类型int64origin_tokens: 类型int64ratio: 类型stringsaving: 类型string
answers: 序列类型stringid: 类型int64ctxs: 列表类型id: 类型stringtext: 类型stringtitle: 类型string
- 分割:
train: 字节数5316136683, 样本数79168eval: 字节数587931406, 样本数8757test: 字节数243224578, 样本数3610
- 下载大小:
3413758169 - 数据集大小:
6147292667
- 特征:
-
config_name: ctxs5
- 特征:
id: 类型int64answers: 序列类型stringquestion: 类型stringcompressed_prompt: 结构类型compressed_prompt: 类型stringcompressed_tokens: 类型int64origin_tokens: 类型int64ratio: 类型stringsaving: 类型string
ctxs: 列表类型id: 类型stringscore: 类型float64text: 类型stringtitle: 类型string
- 分割:
train: 字节数5379479786, 样本数79168eval: 字节数594986589, 样本数8757test: 字节数246104192, 样本数3610
- 下载大小:
3408308518 - 数据集大小:
6220570567
- 特征:
数据文件路径
-
config_name: ctxs1
train:ctxs1/train-*eval:ctxs1/eval-*test:ctxs1/test-*
-
config_name: ctxs100
train:ctxs100/train-*eval:ctxs100/eval-*test:ctxs100/test-*
-
config_name: ctxs5
train:ctxs5/train-*eval:ctxs5/eval-*test:ctxs5/test-*



