roofdancer/presummarized_to_story_wcep
收藏Hugging Face2024-05-25 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/roofdancer/presummarized_to_story_wcep
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: document
dtype: string
- name: summary
dtype: string
splits:
- name: train
num_bytes: 11457142
num_examples: 8158
- name: validation
num_bytes: 1511438
num_examples: 1020
- name: test
num_bytes: 1600022
num_examples: 1022
download_size: 9003423
dataset_size: 14568602
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
---
This dataset is primarily used for document summarization tasks, featuring document and summary as string-type features. The dataset is divided into training, validation, and test sets, containing 8158, 1020, and 1022 samples respectively. The total download size of the dataset is 9003423 bytes, and the total size is 14568602 bytes.
提供机构:
roofdancer
原始信息汇总
数据集信息
特征
- document: 数据类型为字符串(string)
- summary: 数据类型为字符串(string)
数据分割
- train:
- 字节数: 11457142
- 样本数: 8158
- validation:
- 字节数: 1511438
- 样本数: 1020
- test:
- 字节数: 1600022
- 样本数: 1022
数据大小
- 下载大小: 9003423 字节
- 数据集大小: 14568602 字节
配置
- default 配置:
- train: 文件路径为
data/train-* - validation: 文件路径为
data/validation-* - test: 文件路径为
data/test-*
- train: 文件路径为



