A Richly Annotated Corpus for Different Tasks in Automated Fact-Checking

Name: A Richly Annotated Corpus for Different Tasks in Automated Fact-Checking
Creator: 无处不在的知识处理实验室（UKP-TUDA）
Published: 2019-10-30 00:07:12
License: 暂无描述

arXiv2019-10-30 更新2024-06-21 收录

下载链接：

https://tudatalib.ulb.tudarmstadt.de/handle/tudatalib/2081

下载链接

链接失效反馈

官方服务：

资源简介：

本数据集名为‘A Richly Annotated Corpus for Different Tasks in Automated Fact-Checking’，由无处不在的知识处理实验室创建。数据集包含6,422条经过验证的声明，覆盖多个领域，如讨论博客、新闻和社交媒体，这些领域常涉及不可靠信息的创建和传播。数据集的创建过程涉及从Snopes事实检查网站收集数据，并通过众包工作者进行详细标注。该数据集旨在支持自动化事实检查过程中的核心任务，包括文档检索、证据提取、立场检测和声明验证，为解决网络虚假信息问题提供支持。

This dataset, named 'A Richly Annotated Corpus for Different Tasks in Automated Fact-Checking', was developed by the Ubiquitous Knowledge Processing Lab. It includes 6,422 verified claims covering multiple domains such as discussion blogs, news, and social media, which are frequently associated with the creation and dissemination of unreliable information. The development process involved collecting data from the Snopes fact-checking website and performing detailed annotations via crowd workers. This dataset is intended to support core tasks in the automated fact-checking pipeline, including document retrieval, evidence extraction, stance detection, and claim verification, thereby facilitating solutions to the problem of online misinformation.

提供机构：

无处不在的知识处理实验室（UKP-TUDA）

创建时间：

2019-10-30

5,000+

优质数据集

54 个

任务类型

进入经典数据集