Arabic language Web pages dataset
收藏Figshare2017-01-29 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Arabic_Sample/4588702/2
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains 7,976 URIs with content determined to be in the Arabic language. The URIs were collected from 1) the Arabic DMOZ listing, 2) Raddadi, a well-known Arabic directory, and 3) Star28, an Arabic directory. All 7,976 URIs were available on the live Web as of January 2014.<br><b>This data is used and further described in the journal article:</b>Lulwah M. Alkwai, Michael L. Nelson, and Michele C. Weigle. 2017. Comparing the Archival Rate of Arabic, English, Danish, and Korean Language Web Pages. ACM Transactions on Information Systems (TOIS).<br><b>This work was an extension of the paper:</b>Lulwah M. Alkwai, Michael L. Nelson, and Michele C. Weigle. 2015. How Well Are Arabic Websites Archived?. In Proceedings of the 15th IEEE/ACM Joint Conference on Digital Libraries (JCDL). ACM<br>
提供机构:
Lulwah Alkwai
创建时间:
2017-01-27



