mimiklee/masterthesis-longt5-sum
收藏Hugging Face2024-06-27 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/mimiklee/masterthesis-longt5-sum
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个主要特征:abstract(摘要)和article(文章),均为字符串类型。数据集被分为训练集、验证集和测试集,其中训练集包含68,243个样本,验证集和测试集各包含14,624个样本。数据集的下载大小为1,798,318,065字节,总大小为3,489,001,539字节。
The dataset contains two main features: abstract and article, both of which are of string type. The dataset is divided into training, validation, and test sets, with the training set containing 68,243 samples, and the validation and test sets each containing 14,624 samples. The download size of the dataset is 1,798,318,065 bytes, and the total size is 3,489,001,539 bytes.
提供机构:
mimiklee
原始信息汇总
数据集概述
特征
- abstract: 数据类型为字符串。
- article: 数据类型为字符串。
数据分割
- train:
- 字节数: 2440240550
- 样本数: 68243
- validation:
- 字节数: 527509526
- 样本数: 14624
- test:
- 字节数: 521251463
- 样本数: 14624
数据集大小
- 下载大小: 1798318065 字节
- 总大小: 3489001539 字节
配置
- config_name: default
- data_files:
- train: data/train-*
- validation: data/validation-*
- test: data/test-*
- data_files:



