christti/squad-augmented-v2
收藏Hugging Face2023-11-29 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/christti/squad-augmented-v2
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: SQuAD Augmented v2
license: cc-by-4.0
task_categories:
- question-answering
source_datasets:
- extended|wikipedia
task_ids:
- extractive-qa
annotations_creators:
- crowdsourced
language_creators:
- crowdsourced
- found
paperswithcode_id: squad
language:
- en
multilinguality:
- monolingual
size_categories:
- 10K<n<100K
viewer: true
train-eval-index:
- config: plain_text
task: question-answering
task_id: extractive_question_answering
splits:
train_split: train
eval_split: validation
col_mapping:
question: question
context: context
answers:
text: text
answer_start: answer_start
metrics:
- type: squad
name: SQuAD
dataset_info:
features:
- name: id
dtype: string
- name: title
dtype: string
- name: context
dtype: string
- name: question
dtype: string
- name: answers
sequence:
- name: text
dtype: string
- name: answer_start
dtype: int32
config_name: plain_text
splits:
- name: train
num_bytes: 156093315
num_examples: 169211
- name: validation
num_bytes: 10472653
num_examples: 10570
download_size: 35142551
dataset_size: 89789763
---
提供机构:
christti
原始信息汇总
SQuAD Augmented v2 数据集概述
基本信息
- 名称: SQuAD Augmented v2
- 许可证: CC BY 4.0
- 任务类别: 问答
- 源数据集: 扩展自 Wikipedia
- 任务标识: 抽取式问答
- 语言: 英语
- 多语言性: 单语种
- 数据集大小类别: 10K<n<100K
创建者信息
- 标注创建者: 众包
- 语言创建者: 众包、现有资源
数据集详情
- 配置名称: plain_text
- 分割:
- 训练集:
- 名称: train
- 字节数: 156093315
- 样本数: 169211
- 验证集:
- 名称: validation
- 字节数: 10472653
- 样本数: 10570
- 训练集:
- 下载大小: 35142551
- 数据集大小: 89789763
特征信息
- 特征:
- id: 字符串
- title: 字符串
- context: 字符串
- question: 字符串
- answers: 序列
- text: 字符串
- answer_start: 整数 (int32)
训练与评估
- 配置: plain_text
- 任务: 问答
- 任务标识: 抽取式问答
- 分割:
- 训练分割: train
- 评估分割: validation
- 列映射:
- question: question
- context: context
- answers:
- text: text
- answer_start: answer_start
- 评估指标:
- 类型: squad
- 名称: SQuAD



