Arabic Text Copus (Raw, Unfiltered)
收藏DataCite Commons2025-05-01 更新2024-07-29 收录
下载链接:
https://figshare.com/articles/dataset/Arabic_Text_Copus_Raw_Unfiltered_/21605772/1
下载链接
链接失效反馈官方服务:
资源简介:
The corpus is extracted from extracted from 29,192,662 ClueWeb html. Each text file contains in average 30,000 different webpages. The size of the corpus is 18,482,719 terms.
提供机构:
figshare
创建时间:
2022-11-22



