SudSenti2 和 SudSenti3
收藏arXiv2022-01-30 更新2024-06-21 收录
下载链接:
https://github.com/mustafa20999/Sudanese-Arabic-Sentiment-Datasets
下载链接
链接失效反馈官方服务:
资源简介:
本研究介绍了两个新的公开可用数据集:2类苏丹情感数据集SudSenti2和3类苏丹情感数据集SudSenti3。SudSenti2从Facebook和YouTube收集,包含4000条数据,分为正面和负面情感。SudSenti3则从Twitter收集,包含7109条数据,分为正面、负面和中性情感。这两个数据集用于支持苏丹阿拉伯语的情感分析研究,旨在通过深度学习模型提取最佳特征,解决特定语境下的情感分类问题。
This study introduces two newly publicly available datasets: SudSenti2, a 2-class Sudanese sentiment dataset, and SudSenti3, a 3-class Sudanese sentiment dataset. SudSenti2, collected from Facebook and YouTube, contains 4000 instances labeled with positive and negative sentiments. SudSenti3, collected from Twitter, contains 7109 instances labeled with positive, negative, and neutral sentiments. These two datasets are developed to support sentiment analysis research on Sudanese Arabic, aiming to extract optimal features via deep learning models and address sentiment classification issues in specific contexts.
提供机构:
信息科学与技术学院
创建时间:
2022-01-30



