timaeus/pile-enron_emails-elimination-disjoint-slm-l1sae1139
收藏Hugging Face2025-03-18 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/timaeus/pile-enron_emails-elimination-disjoint-slm-l1sae1139
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本和元数据信息的训练数据集,其中文本数据类型为字符串,元数据包含一个名为pile_set_name的字段。数据集分为训练集,共有35103个样本,数据集大小为65711128.94982字节。
This is a training dataset containing text and metadata, where the text data type is string, and the metadata includes a field named pile_set_name. The dataset is split into a training set with a total of 35,103 samples, and the dataset size is 65,711,128.94982 bytes.
提供机构:
timaeus



