tyzhu/lmind_nq_train6000_eval6489_v1_doc_qa_random_permute
收藏Hugging Face2024-03-28 更新2024-06-11 收录
下载链接:
https://hf-mirror.com/datasets/tyzhu/lmind_nq_train6000_eval6489_v1_doc_qa_random_permute
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: answers
struct:
- name: answer_start
sequence: 'null'
- name: text
sequence: string
- name: inputs
dtype: string
- name: targets
dtype: string
splits:
- name: all_docs_eval
num_bytes: 7125701
num_examples: 10925
- name: validation
num_bytes: 752802
num_examples: 6489
- name: train_qa
num_bytes: 697367
num_examples: 6000
- name: train_ic_qa
num_bytes: 4540536
num_examples: 6000
- name: train
num_bytes: 29202939
num_examples: 49700
- name: eval_ic_qa
num_bytes: 4906186
num_examples: 6489
- name: eval_recite_qa
num_bytes: 4912675
num_examples: 6489
- name: all_docs
num_bytes: 7126313
num_examples: 10925
- name: eval_qa
num_bytes: 752802
num_examples: 6489
- name: train_recite_qa
num_bytes: 4546536
num_examples: 6000
- name: first_permute_docs
num_bytes: 37615961
num_examples: 57692
- name: random_permute_docs
num_bytes: 28505572
num_examples: 43700
download_size: 42973377
dataset_size: 130685390
---
# Dataset Card for "lmind_nq_train6000_eval6489_v1_doc_qa_random_permute"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
---
数据集信息:
特征字段:
- 名称:answers,为结构体类型,包含以下子字段:
- answer_start:可为空值的序列
- text:字符串类型的序列
- 名称:inputs,数据类型为字符串
- 名称:targets,数据类型为字符串
数据集划分:
- 名称:all_docs_eval,占用字节数:7125701,样本数量:10925
- 名称:validation(验证集),占用字节数:752802,样本数量:6489
- 名称:train_qa,占用字节数:697367,样本数量:6000
- 名称:train_ic_qa,占用字节数:4540536,样本数量:6000
- 名称:train(训练集),占用字节数:29202939,样本数量:49700
- 名称:eval_ic_qa,占用字节数:4906186,样本数量:6489
- 名称:eval_recite_qa,占用字节数:4912675,样本数量:6489
- 名称:all_docs,占用字节数:7126313,样本数量:10925
- 名称:eval_qa,占用字节数:752802,样本数量:6489
- 名称:train_recite_qa,占用字节数:4546536,样本数量:6000
- 名称:first_permute_docs,占用字节数:37615961,样本数量:57692
- 名称:random_permute_docs,占用字节数:28505572,样本数量:43700
下载大小:42973377字节
数据集总存储大小:130685390字节
---
# 「lmind_nq_train6000_eval6489_v1_doc_qa_random_permute」数据集卡片
[更多相关信息请参阅](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
tyzhu
原始信息汇总
数据集概述
数据集特征
- answers
- answer_start: 类型为null
- text: 类型为字符串
- inputs: 类型为字符串
- targets: 类型为字符串
数据集分割
- all_docs_eval
- num_bytes: 7125701
- num_examples: 10925
- validation
- num_bytes: 752802
- num_examples: 6489
- train_qa
- num_bytes: 697367
- num_examples: 6000
- train_ic_qa
- num_bytes: 4540536
- num_examples: 6000
- train
- num_bytes: 29202939
- num_examples: 49700
- eval_ic_qa
- num_bytes: 4906186
- num_examples: 6489
- eval_recite_qa
- num_bytes: 4912675
- num_examples: 6489
- all_docs
- num_bytes: 7126313
- num_examples: 10925
- eval_qa
- num_bytes: 752802
- num_examples: 6489
- train_recite_qa
- num_bytes: 4546536
- num_examples: 6000
- first_permute_docs
- num_bytes: 37615961
- num_examples: 57692
- random_permute_docs
- num_bytes: 28505572
- num_examples: 43700
数据集大小
- download_size: 42973377
- dataset_size: 130685390



