five

tasksource/mctest-nli

收藏
Hugging Face2024-06-03 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/tasksource/mctest-nli
下载链接
链接失效反馈
官方服务:
资源简介:
--- configs: - config_name: default data_files: - split: train path: data/train-* - split: test path: data/test-* - split: validation path: data/validation-* dataset_info: features: - name: label dtype: string - name: premise dtype: string - name: hypothesis dtype: string splits: - name: train num_bytes: 6945271 num_examples: 5920 - name: test num_bytes: 3804719 num_examples: 3360 - name: validation num_bytes: 1477453 num_examples: 1280 download_size: 731098 dataset_size: 12227443 --- # Dataset Card for "mctest-nli" ``` @inproceedings{richardson-etal-2013-mctest, title = "{MCT}est: A Challenge Dataset for the Open-Domain Machine Comprehension of Text", author = "Richardson, Matthew and Burges, Christopher J.C. and Renshaw, Erin", editor = "Yarowsky, David and Baldwin, Timothy and Korhonen, Anna and Livescu, Karen and Bethard, Steven", booktitle = "Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing", month = oct, year = "2013", address = "Seattle, Washington, USA", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/D13-1020", pages = "193--203", } ```

--- 配置项: - 配置名称: default 数据文件: - 数据集拆分: 训练集(train), 文件路径: data/train-* - 数据集拆分: 测试集(test), 文件路径: data/test-* - 数据集拆分: 验证集(validation), 文件路径: data/validation-* 数据集信息: 特征字段: - 字段名: label, 数据类型: 字符串(string) - 字段名: premise, 数据类型: 字符串(string) - 字段名: hypothesis, 数据类型: 字符串(string) 数据集拆分详情: - 拆分名称: 训练集(train), 字节大小: 6945271, 样本数量: 5920 - 拆分名称: 测试集(test), 字节大小: 3804719, 样本数量: 3360 - 拆分名称: 验证集(validation), 字节大小: 1477453, 样本数量: 1280 下载总大小: 731098, 数据集总存储大小: 12227443 --- # "mctest-nli"数据集卡片 @inproceedings{richardson-etal-2013-mctest, title = "{MCT}est:面向文本开放域机器阅读理解(Machine Comprehension)的挑战数据集", author = "Richardson, Matthew and Burges, Christopher J.C. and Renshaw, Erin", editor = "Yarowsky, David and Baldwin, Timothy and Korhonen, Anna and Livescu, Karen and Bethard, Steven", booktitle = "2013年自然语言处理经验方法会议(Conference on Empirical Methods in Natural Language Processing,EMNLP)论文集", month = 10月, year = 2013, address = "美国华盛顿州西雅图", publisher = "计算语言学协会(Association for Computational Linguistics,ACL)", url = "https://aclanthology.org/D13-1020", pages = "193--203", }
提供机构:
tasksource
原始信息汇总

数据集概述

配置信息

  • 默认配置 (config_name: default) 包含以下数据文件:
    • 训练集 (split: train):路径为 data/train-*
    • 测试集 (split: test):路径为 data/test-*
    • 验证集 (split: validation):路径为 data/validation-*

数据集信息

  • 特征

    • label:数据类型为 string
    • premise:数据类型为 string
    • hypothesis:数据类型为 string
  • 分割详情

    • 训练集 (name: train):
      • 数据大小:6945271 字节
      • 示例数量:5920
    • 测试集 (name: test):
      • 数据大小:3804719 字节
      • 示例数量:3360
    • 验证集 (name: validation):
      • 数据大小:1477453 字节
      • 示例数量:1280
  • 下载大小:731098 字节

  • 数据集总大小:12227443 字节

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作