tsch00001/wikipedia-ar-small-mlm
收藏Hugging Face2025-01-30 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/tsch00001/wikipedia-ar-small-mlm
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本类型的特征,分为训练集和测试集。训练集包含1,620,141个示例,大小为173MB;测试集包含16,104个示例,大小为1.7MB。数据集总大小约为170MB。
The dataset includes text features and is divided into training and test sets. The training set contains 1,620,141 examples, totaling 173MB in size; the test set contains 16,104 examples, totaling 1.7MB in size. The total size of the dataset is approximately 170MB.
提供机构:
tsch00001



