five

A Twitter-based Arabic Mental Health Disorder (MHD) Dataset

收藏
DataCite Commons2023-02-17 更新2024-07-13 收录
下载链接:
https://napier-repository.worktribe.com/Output/3027591
下载链接
链接失效反馈
官方服务:
资源简介:
Sentiment classification is a dominant task in the sentiment analysis field. This task requires a huge, annotated corpus to feed into training models. Manual annotation is the optimal technique for this task, but it is a time-consuming and extensive process that is also prone to human bias. In this paper, we introduce automatic annotation for a Twitter-based Arabic Mental Health Disorder (MHD) dataset by employing transfer learning. We have utilized the existing manual annotation datasets with three cutting-edge Arabic language models. To validate the MHD dataset, we performed a manual annotation on it and calculated the inter-annotator agreement metric between the manual and proposed approaches using Cohen's Kappa statistic. According to the findings, the MHD dataset has a Cohen's Kappa of k = 0.85, which indicates a strong agreement between both annotation approaches. In addition to that, we conducted different baseline models for which we present the results.
提供机构:
Edinburgh Napier University
创建时间:
2023-02-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作