KETI-AIR/kor_quartz
收藏Hugging Face2023-11-15 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/KETI-AIR/kor_quartz
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- ko
license: cc-by-4.0
dataset_info:
features:
- name: data_index_by_user
dtype: int32
- name: question
dtype: string
- name: choices
struct:
- name: text
sequence: string
- name: label
sequence: string
- name: answerKey
dtype: string
- name: para
dtype: string
- name: para_id
dtype: string
- name: para_anno
struct:
- name: effect_prop
dtype: string
- name: cause_dir_str
dtype: string
- name: effect_dir_str
dtype: string
- name: cause_dir_sign
dtype: string
- name: effect_dir_sign
dtype: string
- name: cause_prop
dtype: string
- name: question_anno
struct:
- name: more_effect_dir
dtype: string
- name: less_effect_dir
dtype: string
- name: less_cause_prop
dtype: string
- name: more_effect_prop
dtype: string
- name: less_effect_prop
dtype: string
- name: less_cause_dir
dtype: string
splits:
- name: train
num_bytes: 1326969
num_examples: 2696
- name: validation
num_bytes: 192332
num_examples: 384
- name: test
num_bytes: 385569
num_examples: 784
download_size: 618394
dataset_size: 1904870
---
# Dataset Card for quartz
## Licensing Information
The data is distributed under the [CC BY 4.0](https://creativecommons.org/licenses/by/4.0/) license.
## Source Data Citation Information
```
@InProceedings{quartz,
author = {Oyvind Tafjord and Matt Gardner and Kevin Lin and Peter Clark},
title = {"QUARTZ: An Open-Domain Dataset of Qualitative Relationship
Questions"},
year = {"2019"},
}
提供机构:
KETI-AIR
原始信息汇总
数据集卡片 for quartz
数据集信息
特征
- data_index_by_user: 数据类型为
int32 - question: 数据类型为
string - choices: 结构体包含以下字段:
- text: 序列类型为
string - label: 序列类型为
string
- text: 序列类型为
- answerKey: 数据类型为
string - para: 数据类型为
string - para_id: 数据类型为
string - para_anno: 结构体包含以下字段:
- effect_prop: 数据类型为
string - cause_dir_str: 数据类型为
string - effect_dir_str: 数据类型为
string - cause_dir_sign: 数据类型为
string - effect_dir_sign: 数据类型为
string - cause_prop: 数据类型为
string
- effect_prop: 数据类型为
- question_anno: 结构体包含以下字段:
- more_effect_dir: 数据类型为
string - less_effect_dir: 数据类型为
string - less_cause_prop: 数据类型为
string - more_effect_prop: 数据类型为
string - less_effect_prop: 数据类型为
string - less_cause_dir: 数据类型为
string
- more_effect_dir: 数据类型为
数据分割
- train: 字节数为 1326969,样本数为 2696
- validation: 字节数为 192332,样本数为 384
- test: 字节数为 385569,样本数为 784
数据大小
- 下载大小: 618394 字节
- 数据集大小: 1904870 字节
许可信息
数据集遵循 CC BY 4.0 许可。
数据来源引用信息
@InProceedings{quartz, author = {Oyvind Tafjord and Matt Gardner and Kevin Lin and Peter Clark}, title = {"QUARTZ: An Open-Domain Dataset of Qualitative Relationship Questions"}, year = {"2019"}, }



