Auditor's Gambit: Metacognition & Social Benchmark
收藏kaggle2026-04-16 更新2026-05-09 收录
下载链接:
https://www.kaggle.com/datasets/spoorthipullal/agi-benchmark-v1
下载链接
链接失效反馈官方服务:
资源简介:
A benchmark dataset of 200 false-premise 'gambits' designed to audit model truth
创建时间:
2026-04-02



