tyzhu/squad_qa_title_v5_full_recite_ans_sent
收藏Hugging Face2023-11-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/tyzhu/squad_qa_title_v5_full_recite_ans_sent
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
dataset_info:
features:
- name: id
dtype: string
- name: title
dtype: string
- name: context
dtype: string
- name: question
dtype: string
- name: answers
sequence:
- name: text
dtype: string
- name: answer_start
dtype: int32
- name: answer
dtype: string
- name: context_id
dtype: string
- name: inputs
dtype: string
- name: targets
dtype: string
splits:
- name: train
num_bytes: 7956437
num_examples: 5070
- name: validation
num_bytes: 413353
num_examples: 300
download_size: 0
dataset_size: 8369790
---
# Dataset Card for "squad_qa_title_v5_full_recite_ans_sent"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The dataset named squad_qa_title_v5_full_recite_ans_sent includes multiple configurations and features. It is divided into training and validation sets, each containing multiple files. The dataset features include id, title, context, question, answers (containing text and answer_start), answer, context_id, inputs, and targets. The training set contains 5070 samples, and the validation set contains 300 samples. The total size of the dataset is 8369790 bytes.
提供机构:
tyzhu
原始信息汇总
数据集概述
配置信息
- 默认配置 (
default)- 数据文件路径:
- 训练集 (
train):data/train-* - 验证集 (
validation):data/validation-*
- 训练集 (
- 数据文件路径:
数据集信息
-
特征字段:
id: 字符串类型title: 字符串类型context: 字符串类型question: 字符串类型answers: 序列类型text: 字符串类型answer_start: 整数类型 (int32)
answer: 字符串类型context_id: 字符串类型inputs: 字符串类型targets: 字符串类型
-
数据分割:
- 训练集 (
train):- 字节数: 7956437
- 样本数: 5070
- 验证集 (
validation):- 字节数: 413353
- 样本数: 300
- 训练集 (
-
数据集大小:
- 下载大小: 0
- 数据集大小: 8369790



