timaeus/pile-wikipedia_en-elimination-disjoint-slm-l1sae83
收藏Hugging Face2025-03-18 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/timaeus/pile-wikipedia_en-elimination-disjoint-slm-l1sae83
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本和元数据信息,文本信息存储在text字段中,而元数据信息存储在meta字段中,其中meta字段包括一个名为pile_set_name的字符串字段。数据集分为训练集,共有15713个示例,总大小约为47190244.49264字节。数据集的下载大小为8744991字节。
The dataset includes text and metadata information, with the text stored in the text field and metadata in the meta field, which includes a string field named pile_set_name. The dataset is split into a training set with a total of 15713 examples and a size of approximately 47190244.49264 bytes. The download size of the dataset is 8744991 bytes.
提供机构:
timaeus



