kuklinmike/wikipedia_ru
收藏Hugging Face2024-11-25 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/kuklinmike/wikipedia_ru
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个主要字段:id(唯一标识符)、url(资源链接)、title(标题)和text(文本内容)。数据集被分割为训练集(train),包含1,520,810个样本,总大小为7,632,869,201.84字节,下载大小为4,473,154,243字节。数据集的文件路径为data/train-*。
The dataset contains four main fields: id (unique identifier), url (resource link), title (title), and text (text content). The dataset is split into a training set (train) with 1,520,810 samples, a total size of 7,632,869,201.84 bytes, and a download size of 4,473,154,243 bytes. The file path for the dataset is data/train-*.
提供机构:
kuklinmike



