ClaimsKG - A Knowledge Graph of Fact-Checked Claims (August, 2022)
收藏CESSDA2023-03-31 更新2024-08-03 收录
下载链接:
https://datacatalogue.cessda.eu/detail?lang=en&q=8662fbeb8de20ed35654de4a4f75d3ff527b895690a783aad002e84384dfd317
下载链接
链接失效反馈官方服务:
资源简介:
ClaimsKG is a knowledge graph of metadata information for 59580 fact-checked claims scraped from 13 fact-checking sites. In addition to providing a single dataset of claims and associated metadata, truth ratings are harmonised and additional information is provided for each claim, e.g., about mentioned entities. Please see (https://data.gesis.org/claimskg/) for further details about the data model and statistics.
The dataset facilitates structured queries about claims, their truth values, involved entities, authors, dates, and other kinds of metadata. ClaimsKG is generated through a (semi-)automated pipeline, which harvests claim-related data from popular fact-checking web sites, annotates them with related entities from DBpedia/Wikipedia, and lifts all data to RDF using established vocabularies (such as schema.org).
The latest release of ClaimsKG covers 59580 claims. The data was scraped till August, of 2022 containing claims published between the years 1996-2022 from 13 factchecking websites. The claim-review (fact checking) period for claims ranges between the year 1996 to 2022. Entity fishing python client (https://github.com/hirmeos/entity-fishing-client-python) has been used for entity linking and disambiguation in this release. The dataset contains a total of 1371271 entities detected and referenced with DBpedia. More information, such as detailed statistics, query examples and a user-friendly interface to explore the knowledge graph is available at: https://data.gesis.org/claimskg/ .
The first two releases of ClaimsKG are hosted at Zenodo (https://doi.org/10.5281/zenodo.3518960), ClaimsKGV1.0 (published on 04.04.2019), ClaimsKGV2.0 (published on 01.09.2019). This latest release of ClaimsKG supersedes the previous versions as it contains all the claims from the previous versions together with additional claims as well as improved entity annotations.
ClaimsKG是一款知识图谱(knowledge graph),收纳了从13家事实核查网站爬取得到的59580条经事实核查声明的元数据信息。本数据集不仅提供了声明及其关联元数据的单一集合,还对真实性评级进行了统一规范,并为每条声明补充了额外信息,例如其所涉及的实体。如需了解数据模型与统计详情,请访问https://data.gesis.org/claimskg/。
该数据集支持针对声明、其真实性取值、涉及实体、作者、发布日期及其他类型元数据的结构化查询。ClaimsKG通过半自动化流水线生成:该流水线从热门事实核查网站采集与声明相关的数据,使用DBpedia/维基百科的关联实体完成数据标注,并依托schema.org等通用词汇表将所有数据转换为RDF格式。
本次ClaimsKG的最新版本共涵盖59580条声明。数据爬取工作截至2022年8月,收录了1996年至2022年间由13家事实核查网站发布的全部声明,相关声明的事实核查周期同样覆盖1996年至2022年。本版本中,实体链接与消歧任务采用了Entity Fishing Python客户端(https://github.com/hirmeos/entity-fishing-client-python)完成。该数据集总计检测并引用了1371271个关联DBpedia的实体。更多信息,包括详细统计数据、查询示例以及用于探索该知识图谱的友好型用户界面,均可通过https://data.gesis.org/claimskg/ 获取。
ClaimsKG的前两个版本托管于Zenodo平台(https://doi.org/10.5281/zenodo.3518960),分别为2019年4月4日发布的ClaimsKGV1.0,以及2019年9月1日发布的ClaimsKGV2.0。本次最新版本取代了此前所有版本,不仅包含了过往版本的全部声明,还新增了更多声明,并优化了实体标注流程。
提供机构:
GESIS Data Archive for the Social Sciences
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



