PPN Dataset
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/hybrinfox/ppn
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是 “Propagandist Pseudo-News”(PPN)数据集,包含多个由 VIGINUM 认定的俄罗斯控制的新闻网站上发布的文章,这些网站旨在传播亲俄叙事或在西方国内制造极端对立。该数据集囊括了包括 Reliable Recent News、Tribunal Ukraine、War on Fakes、Notre Pays、La Virgule 等站点的文章,覆盖多种语言(如英语、法语、俄语、德语、西班牙语、中文等)。每篇文章都带有元数据,比如作者、下载日期、发布日期、标题、语言、域名、正文内容、摘要、图片URL 等字段。这个数据集可以用于研究宣传文风、伪新闻识别、跨语言极化内容检测与风格分析等任务。
This dataset is the "Propagandist Pseudo-News" (PPN) dataset, which contains articles published on multiple Russian-controlled news websites identified by VIGINUM. These websites aim to disseminate pro-Russian narratives or incite extreme polarization within Western domestic societies. The dataset covers articles from platforms including Reliable Recent News, Tribunal Ukraine, War on Fakes, Notre Pays, La Virgule and other similar sites, supporting multiple languages such as English, French, Russian, German, Spanish, Chinese and more. Each article is accompanied by comprehensive metadata fields including author, download date, publication date, title, language, domain name, main text content, abstract, image URL and other relevant information. This dataset can be applied to research tasks such as propagandistic writing style analysis, pseudo-news recognition, cross-lingual polarized content detection and stylistic analysis.
提供机构:
hybrinfox



