five

TweEvent: A dataset of Twitter messages about events in the Ukraine conflict

收藏
DataCite Commons2023-03-22 更新2024-07-13 收录
下载链接:
https://mediatum.ub.tum.de/1703244
下载链接
链接失效反馈
官方服务:
资源简介:
Information about incidents within a conflict, e.g., shelling of an area of interest, is scattered amongst different data or media sources. For example, the ACLED dataset continuously documents local incidents recorded within the context of a specific conflict such as Russia’s war in Ukraine. However, these blocks of information might be incomplete. Therefore, it is useful to collect data from several sources to enrich the information pool of a certain incident. In this paper, we present a dataset of social media messages covering the same war events as those collected in the ACLED dataset. The information is extracted from automatically geocoded Twitter text data using state-of-the-art natural language processing methods based on large pre-trained language models (LMs). Our method can be applied to various textual data sources. Both the data as well as the approach can serve to help human analysts obtain a broader understanding of conflict events.
提供机构:
Technical University of Munich
创建时间:
2023-03-22
二维码
社区交流群
二维码
科研交流群
商业服务