five

CFEVER

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/awslabs/fever/tree/master/fever-annotations-platform
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集名为CFEVER,它遵循FEVER的标注方法构建而成,该方法涉及基于维基百科数据生成主张,并对这些主张标注为“支持”、“反驳”或“信息不足”。这些主张是由母语为中文的说话者生成的,数据集中包含了需要来自多个页面的证据的各种主张类型。该数据集包含了来自2022年12月版中文维基百科的1,187,751个页面,主张是基于访问量最高的页面生成的。其任务是为事实核实生成和标注主张。

This dataset is named CFEVER, which is constructed following the annotation methodology of the FEVER dataset. This methodology entails generating claims based on Wikipedia data and annotating these claims with three standard labels: "Supports", "Refutes", or "Not Enough Information". These claims are generated by native Chinese speakers, and the dataset encompasses diverse claim types that necessitate evidence from multiple Wikipedia pages. The dataset includes 1,187,751 pages sourced from the December 2022 edition of the Chinese Wikipedia, with the claims built upon the most frequently visited pages within this corpus. The core task of this dataset is to generate and annotate claims for fact verification.
提供机构:
Authors of the paper
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作