five

PolitiCause: An Annotation Scheme and Corpus for Causality in Political Texts

收藏
DataCite Commons2025-06-12 更新2025-04-16 收录
下载链接:
https://refubium.fu-berlin.de/handle/fub188/46047
下载链接
链接失效反馈
官方服务:
资源简介:
An Annotation Scheme and Corpus for Causality in Political Text. PolitiCAUSE is a corpus of political texts annotated for causality. We provide two types of information: (1) whether a sentence contains a causal relation or not (2) the spans of text that correspond to the cause and effect components of the causal relation. The dataset is available in two ways: (1) As a full dataset containing all annotations and statistics for 55,754 annotation instances. (2) As a train, validation and test splits containing the text and the label of 17,780 unique sentences. We benchmarked the dataset using three transformer-based classification models, the models achieve a moderate performance on the dataset, with a MCC score of 0.62. PolitiCAUSE is a valuable resource for studying causality in texts, especially in the domain of political discourse.
提供机构:
Freie Universität Berlin
创建时间:
2024-12-20
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作