tyzhu/squad_qa_no_id_v5_full_recite_full_passage
收藏Hugging Face2023-11-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/tyzhu/squad_qa_no_id_v5_full_recite_full_passage
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
dataset_info:
features:
- name: id
dtype: string
- name: title
dtype: string
- name: context
dtype: string
- name: question
dtype: string
- name: answers
sequence:
- name: text
dtype: string
- name: answer_start
dtype: int32
- name: answer
dtype: string
- name: context_id
dtype: string
- name: inputs
dtype: string
- name: targets
dtype: string
splits:
- name: train
num_bytes: 9247014
num_examples: 5070
- name: validation
num_bytes: 580390
num_examples: 300
download_size: 1781909
dataset_size: 9827404
---
# Dataset Card for "squad_qa_no_id_v5_full_recite_full_passage"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
This dataset is a question answering dataset with multiple configurations and features. It is divided into training and validation sets, each containing multiple files. The dataset features include id, title, context, question, answers (containing text and answer_start), answer, context_id, inputs, and targets. The size and number of samples of the dataset are also provided in the file.
提供机构:
tyzhu
原始信息汇总
数据集概述
数据集名称
- squad_qa_no_id_v5_full_recite_full_passage
数据集配置
- 默认配置
- 训练数据文件路径:
data/train-* - 验证数据文件路径:
data/validation-*
- 训练数据文件路径:
数据集特征
- id: 字符串类型
- title: 字符串类型
- context: 字符串类型
- question: 字符串类型
- answers: 序列类型
- text: 字符串类型
- answer_start: 整数类型 (int32)
- answer: 字符串类型
- context_id: 字符串类型
- inputs: 字符串类型
- targets: 字符串类型
数据集分割
- 训练集
- 字节数: 9247014
- 样本数: 5070
- 验证集
- 字节数: 580390
- 样本数: 300
数据集大小
- 下载大小: 1781909 字节
- 数据集大小: 9827404 字节



