five

TweetSentBR

收藏
arXiv2017-12-24 更新2024-06-21 收录
下载链接:
http://bitbucket.org/HBrum/tweetsentbr/
下载链接
链接失效反馈
官方服务:
资源简介:
TweetSentBR是一个专为巴西葡萄牙语设计的情感分析数据集,由圣保罗大学数学与计算机科学研究所创建。该数据集包含15,000条从Twitter上提取的推文,专注于电视节目领域,每条推文均由七位标注者手动标注为正面、中性或负面情感。数据集的创建过程严格遵循标注指南,确保标注的可靠性。TweetSentBR不仅支持情感分类研究,还特别关注中性情感的标注,以更真实地反映社交媒体上的用户情感表达。该数据集适用于开发和评估新的机器学习模型,尤其是在深度学习和自然语言处理领域。

TweetSentBR is a sentiment analysis dataset designed specifically for Brazilian Portuguese, created by the Institute of Mathematics and Computer Science at the University of São Paulo. This dataset contains 15,000 tweets extracted from Twitter, focusing on the domain of television programs. Each tweet was manually annotated by seven annotators with three sentiment labels: positive, neutral, and negative. The dataset was developed in strict adherence to annotation guidelines to ensure annotation reliability. Beyond supporting sentiment classification research, TweetSentBR places special emphasis on neutral sentiment annotation, enabling a more realistic reflection of user emotional expressions on social media. This dataset is suitable for developing and evaluating novel machine learning models, particularly in the fields of deep learning and natural language processing.
提供机构:
圣保罗大学数学与计算机科学研究所
创建时间:
2017-12-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作