PANACEA
收藏arXiv2025-09-30 收录
下载链接:
https://doi.org/10.5281/zenodo.6493847
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个新颖的集合,包含了关于新冠病毒的异质化声明及其各自的信息来源,旨在为自动真实性评估提供一组独特的声明。该数据集中的声明被标记为真或假,来源于不同的渠道,如事实核查网站、健康信息网站和科学期刊。数据集分为两个版本,各自有不同的声明相似度阈值。规模方面,大数据集包含5,143条声明,而小数据集包含1,709条声明。其任务是进行声明真实性的评估。
This dataset is a novel collection of heterogeneous COVID-19-related claims and their respective information sources, designed to provide a unique corpus of claims for automated authenticity evaluation. All claims in the dataset are labeled as either true or false, and are sourced from diverse channels including fact-checking websites, health information platforms, and scientific journals. The dataset is offered in two versions, each with a distinct claim similarity threshold. In terms of scale, the large version contains 5,143 claims, while the small version contains 1,709 claims. The core task of this dataset is claim authenticity assessment.



