CUTEly MAD Dataset
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://data.mendeley.com/datasets/hhksb972pp
下载链接
链接失效反馈官方服务:
资源简介:
CUTEly MAD is a curated dataset for document-level sentiment analysis in the Malayalam language. CUTEly MAD is short for Curated Twitter Malayalam Dataset. The dataset is created by extracting Malayalam tweets from Twitter. A set of both positive and negative sentiment-oriented Malayalam words are identified, which are used as hashtags to extract tweets using Twitter API. Further, these tweets were manually labeled by a proficient annotator, based on their sentiment polarity into two classes, viz. negative and positive. If the sentiment is positive, then 1 is annotated. Otherwise, 0 is labeled for negative sentiment. A total of 2,000 tweets are labeled, where 50% are positive tweets and the other 50% are negative sentiment oriented.
创建时间:
2024-01-29



