LingoIITGN/PoliWAM
收藏Hugging Face2025-08-18 更新2025-08-09 收录
下载链接:
https://hf-mirror.com/datasets/LingoIITGN/PoliWAM
下载链接
链接失效反馈官方服务:
资源简介:
PoliWAM是一个大规模的WhatsApp政治讨论语料库,收集于2019年印度大选期间。它包括原始数据和注释数据,可用于政治话语、错误信息、宣传和多语言代码混合的研究。该数据集共有223,000条消息,来自281个公开的政治群组,由约31,000个唯一用户生成。其中,有3,848条消息被手动标注,用于政治倾向、语言组成、恶意意图和对特定政党的倾向性。
PoliWAM is a large-scale corpus of WhatsApp political discussions collected during the Indian General Elections 2019. It consists of both raw and annotated data, enabling research in political discourse, misinformation, propaganda, and multilingual code-mixing. The dataset includes a total of 223,000 messages from 281 public political groups, generated by approximately 31,000 unique users. An annotation subset of 3,848 messages is manually labeled for political orientation, linguistic composition, malicious intent, and inclination towards specific parties.
提供机构:
LingoIITGN



