five

CUTEly MAD Dataset

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://data.mendeley.com/datasets/hhksb972pp
下载链接
链接失效反馈
官方服务:
资源简介:
CUTEly MAD is a curated dataset for document-level sentiment analysis in the Malayalam language. CUTEly MAD is short for Curated Twitter Malayalam Dataset. The dataset is created by extracting Malayalam tweets from Twitter. A set of both positive and negative sentiment-oriented Malayalam words are identified, which are used as hashtags to extract tweets using Twitter API. Further, these tweets were manually labeled by a proficient annotator, based on their sentiment polarity into two classes, viz. negative and positive. If the sentiment is positive, then 1 is annotated. Otherwise, 0 is labeled for negative sentiment. A total of 2,000 tweets are labeled, where 50% are positive tweets and the other 50% are negative sentiment oriented.
创建时间:
2024-01-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作