KETI-AIR/kor_aeslc
收藏Hugging Face2023-12-05 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/KETI-AIR/kor_aeslc
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
- split: test
path: data/test-*
dataset_info:
features:
- name: data_index_by_user
dtype: int32
- name: subject_line
dtype: string
- name: email_body
dtype: string
splits:
- name: train
num_bytes: 11852680
num_examples: 14436
- name: validation
num_bytes: 1699457
num_examples: 1960
- name: test
num_bytes: 1539749
num_examples: 1906
download_size: 8634308
dataset_size: 15091886
license: cc-by-nc-sa-4.0
---
# Dataset Card for "kor_aeslc"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
# Source Data Citation Information
```
@inproceedings{zhang-tetreault-2019-email,
title = "This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation",
author = "Zhang, Rui and
Tetreault, Joel",
booktitle = "Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics",
month = jul,
year = "2019",
address = "Florence, Italy",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/P19-1043",
doi = "10.18653/v1/P19-1043",
pages = "446--456",
}
```
提供机构:
KETI-AIR
原始信息汇总
数据集概述
数据集配置
- 默认配置:
- 训练集:路径为
data/train-* - 验证集:路径为
data/validation-* - 测试集:路径为
data/test-*
- 训练集:路径为
数据集信息
-
特征:
data_index_by_user:数据类型为int32subject_line:数据类型为stringemail_body:数据类型为string
-
数据分割:
- 训练集:
- 字节数:11852680
- 样本数:14436
- 验证集:
- 字节数:1699457
- 样本数:1960
- 测试集:
- 字节数:1539749
- 样本数:1906
- 训练集:
-
数据大小:
- 下载大小:8634308 字节
- 数据集大小:15091886 字节
许可证
- 许可证:cc-by-nc-sa-4.0
数据集引用
@inproceedings{zhang-tetreault-2019-email, title = "This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation", author = "Zhang, Rui and Tetreault, Joel", booktitle = "Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics", month = jul, year = "2019", address = "Florence, Italy", publisher = "Association for Computational Linguistics", url = "https://aclanthology.org/P19-1043", doi = "10.18653/v1/P19-1043", pages = "446--456", }



