KathirKs/CC-MAIN-2015-27_row_wise_20240823_162945
收藏Hugging Face2024-08-23 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/KathirKs/CC-MAIN-2015-27_row_wise_20240823_162945
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本数据,每个数据项包括文本内容(text)、唯一标识符(uuid)和元数据(meta_data)。元数据中包含dump信息、文件路径、id和url。数据集分为训练集,包含1,393,077个样本,总大小为18,036,285,057字节,下载大小为6,787,873,593字节。
This dataset contains text data, with each data item including text content (text), a unique identifier (uuid), and metadata (meta_data). The metadata includes dump information, file path, id, and url. The dataset is divided into a training set, containing 1,393,077 samples, with a total size of 18,036,285,057 bytes and a download size of 6,787,873,593 bytes.
提供机构:
KathirKs



