five

Generalized Deception Dataset

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/6512467
下载链接
链接失效反馈
官方服务:
资源简介:
We took labeled datasets from five different deception-detection tasks with no licensing issues and converted them to a standard format. We inspected each dataset for quality and generated new cleaned versions.  Task # Deceptive # Truthful Product Reviews 10493 10481 Phishing 6134 9202 Job Scams 608 13735 Political Statements 5669 7167 Fake News 27486 34615 Our data is structured as five jsonlines files (one for each task) with a text to classify and a Boolean is_deceptive label.  Sample data point:  { "text":"the Annies List political group supports third-trimester abortions on demand.", "is_deceptive":true }   Changelog 1.1 Fixed flipped labels in the job scams dataset.
创建时间:
2023-09-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作