tasksource/goal-step-wikihow
收藏Hugging Face2024-05-31 更新2024-06-15 收录
下载链接:
https://hf-mirror.com/datasets/tasksource/goal-step-wikihow
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
dataset_info:
- config_name: goal
features:
- name: video-id
dtype: string
- name: fold-ind
dtype: int64
- name: startphrase
dtype: string
- name: sent1
dtype: string
- name: sent2
dtype: string
- name: gold-source
dtype: string
- name: ending0
dtype: string
- name: ending1
dtype: string
- name: ending2
dtype: string
- name: ending3
dtype: string
- name: label
dtype: int64
splits:
- name: train
num_bytes: 47919020
num_examples: 185231
- name: test
num_bytes: 398512
num_examples: 1703
download_size: 30309018
dataset_size: 48317532
- config_name: order
features:
- name: video-id
dtype: string
- name: fold-ind
dtype: int64
- name: startphrase
dtype: string
- name: sent1
dtype: string
- name: sent2
dtype: string
- name: gold-source
dtype: string
- name: ending0
dtype: string
- name: ending1
dtype: string
- name: label
dtype: int64
splits:
- name: train
num_bytes: 161457398
num_examples: 836128
- name: test
num_bytes: 535305
num_examples: 3100
download_size: 0
dataset_size: 161992703
- config_name: step
features:
- name: video-id
dtype: string
- name: fold-ind
dtype: int64
- name: startphrase
dtype: string
- name: sent1
dtype: string
- name: sent2
dtype: string
- name: gold-source
dtype: string
- name: ending0
dtype: string
- name: ending1
dtype: string
- name: ending2
dtype: string
- name: ending3
dtype: string
- name: label
dtype: int64
splits:
- name: train
num_bytes: 117840509
num_examples: 374278
- name: test
num_bytes: 640583
num_examples: 2250
download_size: 76438559
dataset_size: 118481092
configs:
- config_name: goal
data_files:
- split: train
path: goal/train-*
- split: test
path: goal/test-*
- config_name: order
data_files:
- split: train
path: order/train-*
- split: test
path: order/test-*
- config_name: step
data_files:
- split: train
path: step/train-*
- split: test
path: step/test-*
---
https://github.com/zharry29/wikihow-goal-step
```
@inproceedings{zhang-etal-2020-reasoning,
title = "Reasoning about Goals, Steps, and Temporal Ordering with {W}iki{H}ow",
author = "Zhang, Li and
Lyu, Qing and
Callison-Burch, Chris",
booktitle = "Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
month = nov,
year = "2020",
address = "Online",
publisher = "Association for Computational Linguistics",
url = "https://www.aclweb.org/anthology/2020.emnlp-main.374",
pages = "4630--4639",
}
```
license: MIT许可证
数据集信息:
- config_name: 目标(goal)
特征:
- name: 视频ID(video-id),dtype: 字符串
- name: 折次索引(fold-ind),dtype: 64位整型
- name: 起始短语(startphrase),dtype: 字符串
- name: 句子1(sent1),dtype: 字符串
- name: 句子2(sent2),dtype: 字符串
- name: 黄金来源(gold-source),dtype: 字符串
- name: 结尾0(ending0),dtype: 字符串
- name: 结尾1(ending1),dtype: 字符串
- name: 结尾2(ending2),dtype: 字符串
- name: 结尾3(ending3),dtype: 字符串
- name: 标签(label),dtype: 64位整型
splits:
- name: 训练集(train),num_bytes: 47919020,num_examples: 185231
- name: 测试集(test),num_bytes: 398512,num_examples: 1703
download_size: 30309018
dataset_size: 48317532
- config_name: 顺序(order)
特征:
- name: 视频ID(video-id),dtype: 字符串
- name: 折次索引(fold-ind),dtype: 64位整型
- name: 起始短语(startphrase),dtype: 字符串
- name: 句子1(sent1),dtype: 字符串
- name: 句子2(sent2),dtype: 字符串
- name: 黄金来源(gold-source),dtype: 字符串
- name: 结尾0(ending0),dtype: 字符串
- name: 结尾1(ending1),dtype: 字符串
- name: 标签(label),dtype: 64位整型
splits:
- name: 训练集(train),num_bytes: 161457398,num_examples: 836128
- name: 测试集(test),num_bytes: 535305,num_examples: 3100
download_size: 0
dataset_size: 161992703
- config_name: 步骤(step)
特征:
- name: 视频ID(video-id),dtype: 字符串
- name: 折次索引(fold-ind),dtype: 64位整型
- name: 起始短语(startphrase),dtype: 字符串
- name: 句子1(sent1),dtype: 字符串
- name: 句子2(sent2),dtype: 字符串
- name: 黄金来源(gold-source),dtype: 字符串
- name: 结尾0(ending0),dtype: 字符串
- name: 结尾1(ending1),dtype: 字符串
- name: 结尾2(ending2),dtype: 字符串
- name: 结尾3(ending3),dtype: 字符串
- name: 标签(label),dtype: 64位整型
splits:
- name: 训练集(train),num_bytes: 117840509,num_examples: 374278
- name: 测试集(test),num_bytes: 640583,num_examples: 2250
download_size: 76438559
dataset_size: 118481092
configs:
- config_name: 目标(goal)
data_files:
- split: 训练集(train)
path: goal/train-*
- split: 测试集(test)
path: goal/test-*
- config_name: 顺序(order)
data_files:
- split: 训练集(train)
path: order/train-*
- split: 测试集(test)
path: order/test-*
- config_name: 步骤(step)
data_files:
- split: 训练集(train)
path: step/train-*
- split: 测试集(test)
path: step/test-*
https://github.com/zharry29/wikihow-goal-step
@inproceedings{zhang-etal-2020-reasoning,
title: "基于WikiHow的目标、步骤与时间顺序推理",
author: "张莉、吕清、克里斯·卡利森-伯奇",
booktitle: "2020年自然语言处理经验方法会议(EMNLP 2020)论文集",
month: "11月",
year: "2020",
address: "在线",
publisher: "计算语言学协会",
url: "https://www.aclweb.org/anthology/2020.emnlp-main.374",
pages: "4630--4639",
}
提供机构:
tasksource
原始信息汇总
数据集概述
许可证
- MIT许可证
数据集配置
配置名称:goal
- 特征
- video-id: string
- fold-ind: int64
- startphrase: string
- sent1: string
- sent2: string
- gold-source: string
- ending0: string
- ending1: string
- ending2: string
- ending3: string
- label: int64
- 分割
- train
- 字节数: 47919020
- 样本数: 185231
- test
- 字节数: 398512
- 样本数: 1703
- train
- 下载大小: 30309018
- 数据集大小: 48317532
配置名称:order
- 特征
- video-id: string
- fold-ind: int64
- startphrase: string
- sent1: string
- sent2: string
- gold-source: string
- ending0: string
- ending1: string
- label: int64
- 分割
- train
- 字节数: 161457398
- 样本数: 836128
- test
- 字节数: 535305
- 样本数: 3100
- train
- 下载大小: 0
- 数据集大小: 161992703
配置名称:step
- 特征
- video-id: string
- fold-ind: int64
- startphrase: string
- sent1: string
- sent2: string
- gold-source: string
- ending0: string
- ending1: string
- ending2: string
- ending3: string
- label: int64
- 分割
- train
- 字节数: 117840509
- 样本数: 374278
- test
- 字节数: 640583
- 样本数: 2250
- train
- 下载大小: 76438559
- 数据集大小: 118481092
数据文件路径
- 配置名称:goal
- train: goal/train-*
- test: goal/test-*
- 配置名称:order
- train: order/train-*
- test: order/test-*
- 配置名称:step
- train: step/train-*
- test: step/test-*



