KETI-AIR/kor_ropes
收藏Hugging Face2023-11-15 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/KETI-AIR/kor_ropes
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: ROPES
language:
- ko
license:
- cc-by-4.0
size_categories:
- 10K<n<100K
task_categories:
- question-answering
task_ids:
- extractive-qa
dataset_info:
features:
- name: data_index_by_user
dtype: int32
- name: background
dtype: string
- name: situation
dtype: string
- name: question
dtype: string
- name: answers
sequence:
- name: text
dtype: string
splits:
- name: train
num_bytes: 13608462
num_examples: 10924
- name: validation
num_bytes: 1864822
num_examples: 1688
- name: test
num_bytes: 2158508
num_examples: 1710
download_size: 1465973
dataset_size: 17631792
---
# Dataset Card for ROPES
## Licensing Information
The data is distributed under the [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) license.
## Source Data Citation INformation
```
@inproceedings{Lin2019ReasoningOP,
title={Reasoning Over Paragraph Effects in Situations},
author={Kevin Lin and Oyvind Tafjord and Peter Clark and Matt Gardner},
booktitle={MRQA@EMNLP},
year={2019}
}
pretty_name: ROPES数据集
language:
- 韩语(ko)
license:
- CC BY 4.0
size_categories:
- 10K<n<100K
task_categories:
- 问答(question-answering)
task_ids:
- 抽取式问答(extractive-qa)
dataset_info:
features:
- 名称:用户数据索引(data_index_by_user),数据类型:int32
- 名称:背景文本(background),数据类型:string
- 名称:场景文本(situation),数据类型:string
- 名称:问题文本(question),数据类型:string
- 名称:答案(answers),序列类型,其子字段为:
- 名称:答案文本(text),数据类型:string
splits:
- 名称:训练集(train),字节数:13608462,样本数:10924
- 名称:验证集(validation),字节数:1864822,样本数:1688
- 名称:测试集(test),字节数:2158508,样本数:1710
download_size: 1465973
dataset_size: 17631792
---
# ROPES数据集卡片
## 许可信息
本数据集采用CC BY 4.0(https://creativecommons.org/licenses/by/4.0/)协议进行分发。
## 源数据引用信息
@inproceedings{Lin2019ReasoningOP,
title={情境中的段落推理效应(Reasoning Over Paragraph Effects in Situations)},
author={Kevin Lin、Oyvind Tafjord、Peter Clark、Matt Gardner},
booktitle={MRQA@EMNLP},
year={2019}
}
提供机构:
KETI-AIR
原始信息汇总
数据集概述
基本信息
- 名称: ROPES
- 语言: 韩语
- 许可证: CC BY 4.0
- 大小类别: 10K<n<100K
- 任务类别: 问答
- 任务ID: 抽取式问答
数据集结构
特征
- data_index_by_user: 数据类型为 int32
- background: 数据类型为 string
- situation: 数据类型为 string
- question: 数据类型为 string
- answers: 包含一个序列,序列中的元素为
text,数据类型为 string
分割
- 训练集:
- 字节数: 13608462
- 样本数: 10924
- 验证集:
- 字节数: 1864822
- 样本数: 1688
- 测试集:
- 字节数: 2158508
- 样本数: 1710
大小
- 下载大小: 1465973 字节
- 数据集大小: 17631792 字节



