timaeus/pythia-160m-pile-1m-ig-l1h0
收藏Hugging Face2025-01-31 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/timaeus/pythia-160m-pile-1m-ig-l1h0
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含文本内容和元数据的机器学习数据集,具体包括文本内容(contents)、堆集合名称(pile_set_name)和唯一标识符(id)。数据集分为训练集(train),共有10000个样本。整个数据集大小为16252039个字节,下载大小为10492715个字节。
This machine learning dataset includes text contents, metadata with pile set names, and unique identifiers. The dataset is split into a training set (train) with a total of 10,000 examples. The entire dataset is 16,252,039 bytes in size, with a download size of 10,492,715 bytes.
提供机构:
timaeus



