Amina-Chouigui/ANTCorpusv2.1
收藏数据集卡片 ANTCorpus v2.1
数据集描述
数据集概述
ANTCorpus v2.1 包含 31,525 篇来自多个阿拉伯新闻网站的文章。
支持的任务和排行榜
文本分类和摘要生成。
语言
阿拉伯语
许可信息
下载 ANT Corpus 后,您同意在任何使用 ANT Corpus 进行搜索或实验的材料中引用至少一篇描述 ANT Corpus 的论文,并在任何材料中提及项目的主页。
📄 A. Chouigui, O. Ben Khiroun, and B. Elayeb. An Arabic Multi-source News Corpus: Experimenting on Single-document Extractive Summarization. In Arabian Journal for Science and Engineering (AJSE 2021), 46(08), 1-14, DOI : 10.1007/s13369-020-05258-z , February 2021.
📄 A. Chouigui, O. Ben Khiroun, and B. Elayeb. ANT Corpus : An Arabic News Text Collection for Textual Classification. In proceedings of the 14th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 2017), pp. 135-142, Hammamet, Tunisia, October 30 - November 3, 2017.
📄 A. Chouigui, O. Ben Khiroun, and B. Elayeb. A TF-IDF and Co-occurrence Based Approach for Events Extraction from Arabic News Corpus. In proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB 2018), pp. 272-280, Paris, France, 13-15 June 2018.
📄 A. Chouigui, O. Ben Khiroun and B. Elayeb. Related Terms Extraction from Arabic News Corpus using Word Embedding. In: OTM Conferences & Workshops: Proceedings of the 7th International Workshop on Methods, Evaluation, Tools and Applications for the Creation and Consumption of Structured Data for the e-Society (Meta4eS18), Springer, LNCS, pp. 1-11, Valletta, Malta, 22-26 October 2018.



