tasksource/mctest-nli
收藏Hugging Face2024-06-03 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/tasksource/mctest-nli
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
- split: validation
path: data/validation-*
dataset_info:
features:
- name: label
dtype: string
- name: premise
dtype: string
- name: hypothesis
dtype: string
splits:
- name: train
num_bytes: 6945271
num_examples: 5920
- name: test
num_bytes: 3804719
num_examples: 3360
- name: validation
num_bytes: 1477453
num_examples: 1280
download_size: 731098
dataset_size: 12227443
---
# Dataset Card for "mctest-nli"
```
@inproceedings{richardson-etal-2013-mctest,
title = "{MCT}est: A Challenge Dataset for the Open-Domain Machine Comprehension of Text",
author = "Richardson, Matthew and
Burges, Christopher J.C. and
Renshaw, Erin",
editor = "Yarowsky, David and
Baldwin, Timothy and
Korhonen, Anna and
Livescu, Karen and
Bethard, Steven",
booktitle = "Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing",
month = oct,
year = "2013",
address = "Seattle, Washington, USA",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/D13-1020",
pages = "193--203",
}
```
---
配置项:
- 配置名称: default
数据文件:
- 数据集拆分: 训练集(train), 文件路径: data/train-*
- 数据集拆分: 测试集(test), 文件路径: data/test-*
- 数据集拆分: 验证集(validation), 文件路径: data/validation-*
数据集信息:
特征字段:
- 字段名: label, 数据类型: 字符串(string)
- 字段名: premise, 数据类型: 字符串(string)
- 字段名: hypothesis, 数据类型: 字符串(string)
数据集拆分详情:
- 拆分名称: 训练集(train), 字节大小: 6945271, 样本数量: 5920
- 拆分名称: 测试集(test), 字节大小: 3804719, 样本数量: 3360
- 拆分名称: 验证集(validation), 字节大小: 1477453, 样本数量: 1280
下载总大小: 731098, 数据集总存储大小: 12227443
---
# "mctest-nli"数据集卡片
@inproceedings{richardson-etal-2013-mctest,
title = "{MCT}est:面向文本开放域机器阅读理解(Machine Comprehension)的挑战数据集",
author = "Richardson, Matthew and
Burges, Christopher J.C. and
Renshaw, Erin",
editor = "Yarowsky, David and
Baldwin, Timothy and
Korhonen, Anna and
Livescu, Karen and
Bethard, Steven",
booktitle = "2013年自然语言处理经验方法会议(Conference on Empirical Methods in Natural Language Processing,EMNLP)论文集",
month = 10月,
year = 2013,
address = "美国华盛顿州西雅图",
publisher = "计算语言学协会(Association for Computational Linguistics,ACL)",
url = "https://aclanthology.org/D13-1020",
pages = "193--203",
}
提供机构:
tasksource
原始信息汇总
数据集概述
配置信息
- 默认配置 (
config_name: default) 包含以下数据文件:- 训练集 (
split: train):路径为data/train-* - 测试集 (
split: test):路径为data/test-* - 验证集 (
split: validation):路径为data/validation-*
- 训练集 (
数据集信息
-
特征:
label:数据类型为stringpremise:数据类型为stringhypothesis:数据类型为string
-
分割详情:
- 训练集 (
name: train):- 数据大小:6945271 字节
- 示例数量:5920
- 测试集 (
name: test):- 数据大小:3804719 字节
- 示例数量:3360
- 验证集 (
name: validation):- 数据大小:1477453 字节
- 示例数量:1280
- 训练集 (
-
下载大小:731098 字节
-
数据集总大小:12227443 字节



