AlgorithmicResearchGroup/minipile
收藏Hugging Face2024-08-20 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/AlgorithmicResearchGroup/minipile
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含一个名为text的字符串类型特征。数据集分为训练集、验证集和测试集,分别包含1000000、500和10000个样本。训练集大小为5906108510字节,验证集大小为2779386字节,测试集大小为58558191字节。总数据集大小为5967446087字节,下载大小为3176294664字节。
The dataset contains a feature named text of string type. The dataset is divided into training, validation, and test sets, containing 1000000, 500, and 10000 samples respectively. The training set size is 5906108510 bytes, the validation set size is 2779386 bytes, and the test set size is 58558191 bytes. The total dataset size is 5967446087 bytes, with a download size of 3176294664 bytes.
提供机构:
AlgorithmicResearchGroup



