timaeus/pile-github-elimination-disjoint-slm-l1sae998
收藏Hugging Face2025-03-18 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/timaeus/pile-github-elimination-disjoint-slm-l1sae998
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本和元数据两个特征,文本为字符串类型,元数据中包含pile_set_name字段。数据集分为训练集,共有42,539个示例,总大小为230,755,489.28551字节。
The dataset includes two features: text and metadata, with the text feature being of string type, and the metadata containing a pile_set_name field. The dataset is split into a training set with a total of 42,539 examples and a total size of 230,755,489.28551 bytes.
提供机构:
timaeus



