tyzhu/squad_title_v4_train_30_eval_10
收藏Hugging Face2023-09-26 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/tyzhu/squad_title_v4_train_30_eval_10
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: id
dtype: string
- name: title
dtype: string
- name: context
dtype: string
- name: question
dtype: string
- name: answers
sequence:
- name: text
dtype: string
- name: answer_start
dtype: int32
- name: context_id
dtype: string
- name: inputs
dtype: string
- name: targets
dtype: string
splits:
- name: train
num_bytes: 555104
num_examples: 368
- name: validation
num_bytes: 50807
num_examples: 50
download_size: 105632
dataset_size: 605911
---
# Dataset Card for "squad_title_v4_train_30_eval_10"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
The dataset includes multiple features such as id, title, context, question, answers, etc., where answers is a sequence containing sub-features text and answer_start. The dataset is divided into a training set and a validation set, containing 368 and 50 samples respectively. The total download size of the dataset is 105632 bytes, and the total size is 605911 bytes.
提供机构:
tyzhu
原始信息汇总
数据集概述
数据集名称
- 名称: squad_title_v4_train_30_eval_10
数据集特征
- 特征列表:
- id: 字符串类型
- title: 字符串类型
- context: 字符串类型
- question: 字符串类型
- answers: 序列类型,包含以下子特征:
- text: 字符串类型
- answer_start: 32位整数类型
- context_id: 字符串类型
- inputs: 字符串类型
- targets: 字符串类型
数据集分割
- 训练集:
- 名称: train
- 字节数: 555104
- 样本数: 368
- 验证集:
- 名称: validation
- 字节数: 50807
- 样本数: 50
数据集大小
- 下载大小: 105632
- 数据集大小: 605911



