five

FACTOID

收藏
arXiv2022-05-11 更新2024-06-21 收录
下载链接:
https://github.com/caisa-lab/FACTOID-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
FACTOID数据集由德国马尔堡大学的CAISA实验室创建,专注于分析Reddit上的政治讨论,特别是2020年初至今的虚假新闻传播者。该数据集包含超过4,150名用户和340万条Reddit帖子,不仅包括用户的二元标签,还详细记录了用户的可信度水平(从极低到极高)和政治偏见强度(从极右到极左)。这是首个同时捕捉用户历史帖子的长期上下文及其互动的虚假新闻传播者数据集。数据集的应用领域包括识别虚假信息传播者,通过分析用户的社会联系和心理语言特征,以及研究政治偏见对信息传播的影响,旨在解决社交媒体上的虚假信息问题。

The FACTOID dataset was developed by the CAISA Lab at the University of Marburg, Germany, with a focus on analyzing political discussions on Reddit, especially disinformation spreaders since early 2020. This dataset encompasses more than 4,150 users and 3.4 million Reddit posts. It not only provides binary labels for users but also records detailed information on their credibility levels (ranging from extremely low to extremely high) and the intensity of their political biases (spanning from far-right to far-left). As the first dataset of disinformation spreaders that simultaneously captures the long-term context of users’ historical posts and their social interactions, the FACTOID dataset can be applied to identify disinformation spreaders via analyzing users’ social connections and psycholinguistic features, as well as to investigate the impact of political bias on information dissemination, ultimately aiming to tackle the issue of disinformation on social media platforms.
提供机构:
马尔堡大学数学与计算机科学系
创建时间:
2022-05-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作