five

Arabic language Web pages dataset

收藏
Figshare2017-01-29 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Arabic_Sample/4588702/2
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains 7,976 URIs with content determined to be in the Arabic language. The URIs were collected from 1) the Arabic DMOZ listing, 2) Raddadi, a well-known Arabic directory, and 3) Star28, an Arabic directory. All 7,976 URIs were available on the live Web as of January 2014.<br><b>This data is used and further described in the journal article:</b>Lulwah M. Alkwai, Michael L. Nelson, and Michele C. Weigle. 2017. Comparing the Archival Rate of Arabic, English, Danish, and Korean Language Web Pages. ACM Transactions on Information Systems (TOIS).<br><b>This work was an extension of the paper:</b>Lulwah M. Alkwai, Michael L. Nelson, and Michele C. Weigle. 2015. How Well Are Arabic Websites Archived?. In Proceedings of the 15th IEEE/ACM Joint Conference on Digital Libraries (JCDL). ACM<br>
提供机构:
Lulwah Alkwai
创建时间:
2017-01-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作