five

Amina-Chouigui/ANTCorpusv2.1

收藏
Hugging Face2023-09-17 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Amina-Chouigui/ANTCorpusv2.1
下载链接
链接失效反馈
官方服务:
资源简介:
--- task_categories: - text-classification - summarization language: - ar size_categories: - 10K<n<100K --- # Dataset Card for ANTCorpus v2.1 ## Dataset Description - **Homepage:** https://antcorpus.github.io/ - **Point of Contact:** aminachouigui@gmail.com ### Dataset Summary ANTCorpus v2.1 (31 525 articles with multi-source Arabic news websites) ### Supported Tasks and Leaderboards Text classification and summarization. ### Languages Arabic ### Licensing Information By downloading ANT Corpus, you agree to cite at least one of our papers describing ANT Corpus and/or refer the project's main page in any kind of material you produce where ANT Corpus was used to conduct search or experimentation, whether be it a research paper, dissertation, article, poster, presentation, or documentation. 📄 A. Chouigui, O. Ben Khiroun, and B. Elayeb. An Arabic Multi-source News Corpus: Experimenting on Single-document Extractive Summarization. In Arabian Journal for Science and Engineering (AJSE 2021), 46(08), 1-14, DOI : 10.1007/s13369-020-05258-z , February 2021. 📄 A. Chouigui, O. Ben Khiroun, and B. Elayeb. ANT Corpus : An Arabic News Text Collection for Textual Classification. In proceedings of the 14th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 2017), pp. 135-142, Hammamet, Tunisia, October 30 - November 3, 2017. 📄 A. Chouigui, O. Ben Khiroun, and B. Elayeb. A TF-IDF and Co-occurrence Based Approach for Events Extraction from Arabic News Corpus. In proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB 2018), pp. 272-280, Paris, France, 13-15 June 2018. 📄 A. Chouigui, O. Ben Khiroun and B. Elayeb. Related Terms Extraction from Arabic News Corpus using Word Embedding. In: OTM Conferences & Workshops: Proceedings of the 7th International Workshop on Methods, Evaluation, Tools and Applications for the Creation and Consumption of Structured Data for the e-Society (Meta4eS'18), Springer, LNCS, pp. 1-11, Valletta, Malta, 22-26 October 2018.
提供机构:
Amina-Chouigui
原始信息汇总

数据集卡片 ANTCorpus v2.1

数据集描述

数据集概述

ANTCorpus v2.1 包含 31,525 篇来自多个阿拉伯新闻网站的文章。

支持的任务和排行榜

文本分类和摘要生成。

语言

阿拉伯语

许可信息

下载 ANT Corpus 后,您同意在任何使用 ANT Corpus 进行搜索或实验的材料中引用至少一篇描述 ANT Corpus 的论文,并在任何材料中提及项目的主页。

📄 A. Chouigui, O. Ben Khiroun, and B. Elayeb. An Arabic Multi-source News Corpus: Experimenting on Single-document Extractive Summarization. In Arabian Journal for Science and Engineering (AJSE 2021), 46(08), 1-14, DOI : 10.1007/s13369-020-05258-z , February 2021.

📄 A. Chouigui, O. Ben Khiroun, and B. Elayeb. ANT Corpus : An Arabic News Text Collection for Textual Classification. In proceedings of the 14th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 2017), pp. 135-142, Hammamet, Tunisia, October 30 - November 3, 2017.

📄 A. Chouigui, O. Ben Khiroun, and B. Elayeb. A TF-IDF and Co-occurrence Based Approach for Events Extraction from Arabic News Corpus. In proceedings of the 23rd International Conference on Natural Language & Information Systems (NLDB 2018), pp. 272-280, Paris, France, 13-15 June 2018.

📄 A. Chouigui, O. Ben Khiroun and B. Elayeb. Related Terms Extraction from Arabic News Corpus using Word Embedding. In: OTM Conferences & Workshops: Proceedings of the 7th International Workshop on Methods, Evaluation, Tools and Applications for the Creation and Consumption of Structured Data for the e-Society (Meta4eS18), Springer, LNCS, pp. 1-11, Valletta, Malta, 22-26 October 2018.

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作