five

bethgelab/lm-similarity

收藏
Hugging Face2025-02-08 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/bethgelab/lm-similarity
下载链接
链接失效反馈
官方服务:
资源简介:
这是一个用于研究论文《Great Models Think Alike and this Undermines AI Oversight》的数据集。包含三个数据文件:judge_scores_mmlu_pro_free_filtered记录了九位评委在没有参考答案的情况下对经过筛选的开放式MMLU-Pro数据集的评分;judge_w_gt_mmlu_pro_free_filtered记录了五位评委在可以访问参考选项和真实信息的情况下对经过筛选的OSQ MMLU-Pro的集体评分;filter_mmlu_pro记录了在不需要访问参考选项的情况下可以回答的MMLU-Pro样本的每个样本的决策(粗略和精细)和索引。

This is the dataset for the research paper Great Models Think Alike and this Undermines AI Oversight. It includes three data files: judge_scores_mmlu_pro_free_filtered contains the scores of nine judges without access to reference answers on the filtered open-style MMLU-Pro dataset; judge_w_gt_mmlu_pro_free_filtered contains the ensemble scores of five judges with access to reference options and ground-truth information on the filtered OSQ MMLU-Pro; filter_mmlu_pro contains per-sample decisions (coarse and fine) and indices of samples in MMLU-Pro that can be answered without access to reference options.
提供机构:
bethgelab
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作