five

KurdABSA: Aspect Based Sentiment Analysis Dataset for Kurdish Language

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/h5t7p4bcj2
下载链接
链接失效反馈
官方服务:
资源简介:
Aspect-Based Sentiment Analysis (ABSA) extends traditional sentiment analysis by not only identifying the overall sentiment of a text but also associating specific sentiments with deeper and granular insights. The main objective of ABSA is to accurately extract relevant aspects and determine the sentiment polarity associated with each. Although extensive research has been conducted on ABSA across various languages, low-resource languages such as Kurdish remain largely underexplored in this domain. To address this gap, the present study introduces the first publicly available aspect-based sentiment analysis dataset for the Sorani dialect of Kurdish, addressing a critical gap in natural language processing (NLP) research for low-resource languages. The dataset has more than 4000 quadruplet ABSA in the restaurant review domain, written in the Kurdish language (Sorani dialect) using the Perso-Arabic script. A prompt-based few-shot learning model was employed to automatically annotate the dataset with aspect-opinion-category-sentiment quadruples, guided by a manually annotated support set verified by native Kurdish-language experts. This resource is intended for use in machine learning, deep learning, and cross-lingual model adaptation, making it suitable for training, fine-tuning, and benchmarking.
创建时间:
2026-03-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作