VeritasOmega/veritas-claim-verification-dataset

Name: VeritasOmega/veritas-claim-verification-dataset
Creator: VeritasOmega
Published: 2026-03-23 20:30:25
License: 暂无描述

Hugging Face2026-03-23 更新2026-03-29 收录

下载链接：

https://hf-mirror.com/datasets/VeritasOmega/veritas-claim-verification-dataset

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en license: apache-2.0 task_categories: - text-classification task_ids: - fact-checking pretty_name: VERITAS Claim Verification Dataset size_categories: - 100K<n<1M --- # VERITAS Claim Verification Dataset **775,294 labeled verification calls** (770,000 synthetic + 5,294 adversarial) Multi-model consensus verdicts from the VERITAS Oracle across 8 domains. ## Paper [VERITAS: Independence-Weighted Multi-Model Consensus for AI Output Verification](https://doi.org/10.5281/zenodo.19187611) ## Adversarial Robustness Findings (NEW) 5,294 adversarial variants generated via deterministic transforms: | Transform | N | Accuracy | |---|---|---| | negation | 2,379 | **99.7%** | | jurisdiction | 114 | 100.0% | | quantifier | 337 | 98.8% | | number_perturb | 85 | 98.8% | | temporal | 2,379 | 79.8% | | **Overall adversarial** | **5,294** | **90.7%** | | Baseline | | 98.4% | | Delta | | -7.7pp | VERITAS is highly robust to negation attacks (99.7%) — a category where single-model verifiers typically drop 10-15pp. Temporal hedging (79.8%) is the documented weakness: no time-indexed knowledge retrieval. ## Datasets - eritas_oracle.synthetic_dataset — 770k synthetic claims - eritas_oracle.synthetic_dataset_adversarial — 5,294 adversarial variants with transform labels ## Schema claim_hash, claim_text, domain, ground_truth, source, confidence, consensus, flags, model_votes, trace_id, latency_ms, correct, ingested_at ## Citation ` @misc{lopez2026veritas, title={VERITAS: Independence-Weighted Multi-Model Consensus for AI Output Verification}, author={RJ Lopez}, year={2026}, doi={10.5281/zenodo.19187611} } `

提供机构：

VeritasOmega

5,000+

优质数据集

54 个

任务类型

进入经典数据集