five

Unveiling Global Narratives: A Multilingual Twitter Dataset of News Media on the Russo-Ukrainian Conflict

收藏
arXiv2024-04-08 更新2024-06-21 收录
下载链接:
https://github.com/sherzod-hakimov/ru-ua-news-discourse-twitter
下载链接
链接失效反馈
官方服务:
资源简介:
本数据集名为‘揭秘全球叙事:俄乌冲突新闻媒体多语种Twitter数据集’,由波茨坦大学语言学系计算语言学创建。该数据集收集了2022年2月至2023年5月期间,全球新闻媒体发布的约152万条推文,涵盖60种语言及相应图片。创建过程中,研究者通过Twitter API和Wikidata筛选并处理数据,确保每条推文包含处理后的标签,如实体识别、立场、文本或视觉概念及情感分析。此数据集旨在为研究全球对俄乌冲突的叙事提供丰富资源,适用于媒体研究、冲突分析及国际关系等领域,帮助理解不同地区和文化如何感知和报道此冲突。

This dataset is named 'Unveiling Global Narratives: A Multilingual Twitter Dataset of News Media Coverage on the Russia-Ukraine Conflict', which was created by the Computational Linguistics Team, Department of Linguistics, University of Potsdam. The dataset contains approximately 1.52 million tweets published by global news media between February 2022 and May 2023, covering 60 languages along with their associated images. During its development, researchers filtered and processed the data via the Twitter API and Wikidata, ensuring that each tweet includes annotated labels such as entity recognition results, stance, textual or visual concepts, and sentiment analysis outcomes. This dataset aims to provide a rich resource for researching global narratives surrounding the Russia-Ukraine conflict, and is applicable to fields including media studies, conflict analysis and international relations, to facilitate understanding of how different regions and cultures perceive and report on this conflict.
提供机构:
波茨坦大学语言学系计算语言学
创建时间:
2023-06-22
二维码
社区交流群
二维码
科研交流群
商业服务