Generalized Deception Dataset
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/6512467
下载链接
链接失效反馈官方服务:
资源简介:
We took labeled datasets from five different deception-detection tasks with no licensing issues and converted them to a standard format. We inspected each dataset for quality and generated new cleaned versions.
Task
# Deceptive
# Truthful
Product Reviews
10493
10481
Phishing
6134
9202
Job Scams
608
13735
Political Statements
5669
7167
Fake News
27486
34615
Our data is structured as five jsonlines files (one for each task) with a text to classify and a Boolean is_deceptive label.
Sample data point:
{
"text":"the Annies List political group supports third-trimester abortions on demand.",
"is_deceptive":true
}
Changelog
1.1
Fixed flipped labels in the job scams dataset.
创建时间:
2023-09-23



