timaeus/pythia-160m-pile-1m-ig-l9h3
收藏Hugging Face2025-01-31 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/timaeus/pythia-160m-pile-1m-ig-l9h3
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个字段: contents字段为字符串类型,存储文本内容;metadata字段包含pile_set_name属性,为字符串序列;id字段为整型,唯一标识每个样本。数据集分为训练集,共有10000个样本,总大小为15793538字节。
The dataset includes three fields: the contents field is of string type, storing text content; the metadata field contains the pile_set_name attribute, which is a string sequence; the id field is an integer, uniquely identifying each sample. The dataset is divided into a training set with a total of 10,000 samples and a total size of 15,793,538 bytes.
提供机构:
timaeus



