AfriSenti
收藏arXiv2023-11-05 更新2024-06-21 收录
下载链接:
https://github.com/afrisenti-semeval/afrisent-semeval-2023
下载链接
链接失效反馈官方服务:
资源简介:
AfriSenti是一个针对非洲语言的情感分析基准数据集,由波尔图大学的研究团队创建。该数据集包含14种非洲语言的超过110,000条推文,涵盖了四个主要语言家族。这些推文由母语者标注,用于AfriSentiSemEval共享任务。数据集的创建过程涉及复杂的数据收集和标注挑战,旨在推动非洲语言的情感分析研究,解决这些语言在自然语言处理领域中的代表性不足问题。
AfriSenti is a sentiment analysis benchmark dataset for African languages, developed by a research team at the University of Porto. This dataset contains over 110,000 tweets spanning 14 African languages across four major language families. These tweets were annotated by native speakers and are intended for the AfriSentiSemEval shared task. The dataset creation process involved complex data collection and annotation challenges, with the objective of advancing sentiment analysis research for African languages and addressing their underrepresentation in the field of natural language processing.
提供机构:
波尔图大学
创建时间:
2023-02-17



