five

Data for "The observability gradient predicts where AI benchmarks measure truth versus consensus" (NeurIPS 2026 submission)

收藏
DataCite Commons2026-05-04 更新2026-05-07 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.20029550
下载链接
链接失效反馈
官方服务:
资源简介:
Anonymized dataset accompanying a position paper submitted to NeurIPS 2026 (Position Paper Track). Contains MMLU per-subject accuracy data, MMLU-Pro cross-model results, FActScore entity-level data, Prolific MMLU rater study responses, and computed observability scores. Access is restricted during the double-blind review period.
提供机构:
Zenodo
创建时间:
2026-05-04
二维码
社区交流群
二维码
科研交流群
商业服务