five

OPINAR: A Multi-Source Arabic Opinion Dataset with Keywords, 2004–2025

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/nbmys7wnm2
下载链接
链接失效反馈
官方服务:
资源简介:
OPINAR is a large-scale Arabic opinion-article dataset covering 2004–2025, containing 280,890 articles, approximately X million tokens, and Y million unique word types, collected from 85 news websites across 21 countries (15 Arab and 6 non-Arab). Sources were selected primarily for providing keyword metadata. Each article includes publication date, source, ownership country, keywords/topics when available, and source URL. OPINAR is a large-scale Arabic opinion-article dataset covering 2004–2025, containing 280,885 articles, approximately 181 million tokens, and 3.1 million unique word types, collected from 84 news websites across 22 countries (16 Arab and 6 non-Arab). Sources were selected primarily for providing keyword metadata. Each article includes publication date, source, ownership country, keywords/topics when available, and source URL. The dataset is organized hierarchically by country → source → year → month, and each article is assigned a unique global identifier (OPN_XXXXXXXX). OPINAR supports research in opinion mining, sentiment analysis, discourse analysis, media studies, and Arabic NLP, offering extensive multi-source opinion content with transparent metadata.
创建时间:
2025-12-08
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作