omarkamali/wikipedia-monthly
收藏Hugging Face2026-03-14 更新2025-08-30 收录
下载链接:
https://hf-mirror.com/datasets/omarkamali/wikipedia-monthly
下载链接
链接失效反馈官方服务:
资源简介:
数据集包含多个语言特定的子集,每个子集都有其独特的特征和数据大小。每个子集都有唯一的标识符,特征包括id、标题、url、文本、命名空间和原始mediawiki。拆分包括训练集、1000、5000和10000,每个拆分都有相应数量的示例和字节大小。每个子集还提供了数据集大小和下载大小。
The dataset consists of multiple language-specific subsets, each with its own unique features and data sizes. Each subset has a unique identifier, and the features include id, title, url, text, namespace, and raw_mediawiki. The splits include train, 1000, 5000, and 10000, with corresponding number of examples and byte sizes. The dataset_size and download_size are also provided for each subset.
提供机构:
omarkamali



