nroggendorff/corpus
收藏Hugging Face2024-11-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/nroggendorff/corpus
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含一个名为text的特征,数据类型为字符串。数据集被分割为训练集,包含39,134,000个样本,总大小为146,955,320,709字节。下载大小为91,034,484,984字节。数据文件路径为data/train-*。
The dataset includes a feature named text with a data type of string. It is split into a training set containing 39,134,000 examples, with a total size of 146,955,320,709 bytes. The download size is 91,034,484,984 bytes. The data files are located at data/train-*.
提供机构:
nroggendorff



