five

tasksource/goal-step-wikihow

收藏
Hugging Face2024-05-31 更新2024-06-15 收录
下载链接:
https://hf-mirror.com/datasets/tasksource/goal-step-wikihow
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit dataset_info: - config_name: goal features: - name: video-id dtype: string - name: fold-ind dtype: int64 - name: startphrase dtype: string - name: sent1 dtype: string - name: sent2 dtype: string - name: gold-source dtype: string - name: ending0 dtype: string - name: ending1 dtype: string - name: ending2 dtype: string - name: ending3 dtype: string - name: label dtype: int64 splits: - name: train num_bytes: 47919020 num_examples: 185231 - name: test num_bytes: 398512 num_examples: 1703 download_size: 30309018 dataset_size: 48317532 - config_name: order features: - name: video-id dtype: string - name: fold-ind dtype: int64 - name: startphrase dtype: string - name: sent1 dtype: string - name: sent2 dtype: string - name: gold-source dtype: string - name: ending0 dtype: string - name: ending1 dtype: string - name: label dtype: int64 splits: - name: train num_bytes: 161457398 num_examples: 836128 - name: test num_bytes: 535305 num_examples: 3100 download_size: 0 dataset_size: 161992703 - config_name: step features: - name: video-id dtype: string - name: fold-ind dtype: int64 - name: startphrase dtype: string - name: sent1 dtype: string - name: sent2 dtype: string - name: gold-source dtype: string - name: ending0 dtype: string - name: ending1 dtype: string - name: ending2 dtype: string - name: ending3 dtype: string - name: label dtype: int64 splits: - name: train num_bytes: 117840509 num_examples: 374278 - name: test num_bytes: 640583 num_examples: 2250 download_size: 76438559 dataset_size: 118481092 configs: - config_name: goal data_files: - split: train path: goal/train-* - split: test path: goal/test-* - config_name: order data_files: - split: train path: order/train-* - split: test path: order/test-* - config_name: step data_files: - split: train path: step/train-* - split: test path: step/test-* --- https://github.com/zharry29/wikihow-goal-step ``` @inproceedings{zhang-etal-2020-reasoning, title = "Reasoning about Goals, Steps, and Temporal Ordering with {W}iki{H}ow", author = "Zhang, Li and Lyu, Qing and Callison-Burch, Chris", booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)", month = nov, year = "2020", address = "Online", publisher = "Association for Computational Linguistics", url = "https://www.aclweb.org/anthology/2020.emnlp-main.374", pages = "4630--4639", } ```

license: MIT许可证 数据集信息: - config_name: 目标(goal) 特征: - name: 视频ID(video-id),dtype: 字符串 - name: 折次索引(fold-ind),dtype: 64位整型 - name: 起始短语(startphrase),dtype: 字符串 - name: 句子1(sent1),dtype: 字符串 - name: 句子2(sent2),dtype: 字符串 - name: 黄金来源(gold-source),dtype: 字符串 - name: 结尾0(ending0),dtype: 字符串 - name: 结尾1(ending1),dtype: 字符串 - name: 结尾2(ending2),dtype: 字符串 - name: 结尾3(ending3),dtype: 字符串 - name: 标签(label),dtype: 64位整型 splits: - name: 训练集(train),num_bytes: 47919020,num_examples: 185231 - name: 测试集(test),num_bytes: 398512,num_examples: 1703 download_size: 30309018 dataset_size: 48317532 - config_name: 顺序(order) 特征: - name: 视频ID(video-id),dtype: 字符串 - name: 折次索引(fold-ind),dtype: 64位整型 - name: 起始短语(startphrase),dtype: 字符串 - name: 句子1(sent1),dtype: 字符串 - name: 句子2(sent2),dtype: 字符串 - name: 黄金来源(gold-source),dtype: 字符串 - name: 结尾0(ending0),dtype: 字符串 - name: 结尾1(ending1),dtype: 字符串 - name: 标签(label),dtype: 64位整型 splits: - name: 训练集(train),num_bytes: 161457398,num_examples: 836128 - name: 测试集(test),num_bytes: 535305,num_examples: 3100 download_size: 0 dataset_size: 161992703 - config_name: 步骤(step) 特征: - name: 视频ID(video-id),dtype: 字符串 - name: 折次索引(fold-ind),dtype: 64位整型 - name: 起始短语(startphrase),dtype: 字符串 - name: 句子1(sent1),dtype: 字符串 - name: 句子2(sent2),dtype: 字符串 - name: 黄金来源(gold-source),dtype: 字符串 - name: 结尾0(ending0),dtype: 字符串 - name: 结尾1(ending1),dtype: 字符串 - name: 结尾2(ending2),dtype: 字符串 - name: 结尾3(ending3),dtype: 字符串 - name: 标签(label),dtype: 64位整型 splits: - name: 训练集(train),num_bytes: 117840509,num_examples: 374278 - name: 测试集(test),num_bytes: 640583,num_examples: 2250 download_size: 76438559 dataset_size: 118481092 configs: - config_name: 目标(goal) data_files: - split: 训练集(train) path: goal/train-* - split: 测试集(test) path: goal/test-* - config_name: 顺序(order) data_files: - split: 训练集(train) path: order/train-* - split: 测试集(test) path: order/test-* - config_name: 步骤(step) data_files: - split: 训练集(train) path: step/train-* - split: 测试集(test) path: step/test-* https://github.com/zharry29/wikihow-goal-step @inproceedings{zhang-etal-2020-reasoning, title: "基于WikiHow的目标、步骤与时间顺序推理", author: "张莉、吕清、克里斯·卡利森-伯奇", booktitle: "2020年自然语言处理经验方法会议(EMNLP 2020)论文集", month: "11月", year: "2020", address: "在线", publisher: "计算语言学协会", url: "https://www.aclweb.org/anthology/2020.emnlp-main.374", pages: "4630--4639", }
提供机构:
tasksource
原始信息汇总

数据集概述

许可证

  • MIT许可证

数据集配置

配置名称:goal

  • 特征
    • video-id: string
    • fold-ind: int64
    • startphrase: string
    • sent1: string
    • sent2: string
    • gold-source: string
    • ending0: string
    • ending1: string
    • ending2: string
    • ending3: string
    • label: int64
  • 分割
    • train
      • 字节数: 47919020
      • 样本数: 185231
    • test
      • 字节数: 398512
      • 样本数: 1703
  • 下载大小: 30309018
  • 数据集大小: 48317532

配置名称:order

  • 特征
    • video-id: string
    • fold-ind: int64
    • startphrase: string
    • sent1: string
    • sent2: string
    • gold-source: string
    • ending0: string
    • ending1: string
    • label: int64
  • 分割
    • train
      • 字节数: 161457398
      • 样本数: 836128
    • test
      • 字节数: 535305
      • 样本数: 3100
  • 下载大小: 0
  • 数据集大小: 161992703

配置名称:step

  • 特征
    • video-id: string
    • fold-ind: int64
    • startphrase: string
    • sent1: string
    • sent2: string
    • gold-source: string
    • ending0: string
    • ending1: string
    • ending2: string
    • ending3: string
    • label: int64
  • 分割
    • train
      • 字节数: 117840509
      • 样本数: 374278
    • test
      • 字节数: 640583
      • 样本数: 2250
  • 下载大小: 76438559
  • 数据集大小: 118481092

数据文件路径

  • 配置名称:goal
    • train: goal/train-*
    • test: goal/test-*
  • 配置名称:order
    • train: order/train-*
    • test: order/test-*
  • 配置名称:step
    • train: step/train-*
    • test: step/test-*
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作