five

ExHalder Dataset

收藏
arXiv2025-09-30 收录
下载链接:
https://bit.ly/exhalder-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了6270个人工精选的新闻文章和标题示例,这些示例被标记用于检测虚构内容。其中,有1934个示例被标记为“虚构”,4336个被标记为“蕴含”,还有2074个示例附有评分员撰写的额外评论。数据集按规模分为5190个训练示例、349个验证示例和731个测试示例,其任务是标题虚构内容检测。

This dataset comprises 6,270 manually curated news article and headline examples annotated for fictional content detection. Of these, 1,934 examples are annotated as "fictional", 4,336 as "entailed", and an additional 2,074 examples are accompanied by extra comments written by annotators. The dataset is split into 5,190 training examples, 349 validation examples, and 731 test examples, with the core task being fictional content detection for news headlines.
提供机构:
Authors of the paper
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作