AGIEval

Name: AGIEval
Creator: maas
Published: 2025-11-13 17:43:02
License: 暂无描述

魔搭社区2025-11-13 更新2024-08-31 收录

下载链接：

https://modelscope.cn/datasets/OmniData/AGIEval

下载链接

链接失效反馈

官方服务：

资源简介：

displayName: AGIEval license: - MIT taskTypes: [] mediaTypes: - Text labelTypes: [] tags: - attrs: null id: 11864 name: en: '' zh: 文本检索 publisher: - Microsoft publishDate: '2023-04-01' publishUrl: https://huggingface.co/datasets/lighteval/agi_eval_en paperUrl: https://arxiv.org/pdf/2304.06364.pdf --- # 数据集介绍 ## 简介 AGIEval is a human-centric benchmark specifically designed to evaluate the general abilities of foundation models in tasks pertinent to human cognition and problem-solving. This benchmark is derived from 20 official, public, and high-standard admission and qualification exams intended for general human test-takers, such as general college admission tests (e.g., Chinese College Entrance Exam (Gaokao) and American SAT), law school admission tests, math competitions, lawyer qualification tests, and national civil service exams. For a full description of the benchmark ## 引文 ``` @misc{zhong2023agieval, title={AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models}, author={Wanjun Zhong and Ruixiang Cui and Yiduo Guo and Yaobo Liang and Shuai Lu and Yanlin Wang and Amin Saied and Weizhu Chen and Nan Duan}, year={2023}, eprint={2304.06364}, archivePrefix={arXiv}, primaryClass={cs.CL} ``` ## Download dataset :modelscope-code[]{type="git"}

displayName: AGIEval 许可证: - MIT许可证任务类型: 无媒体类型: - 文本（Text）标签类型: 无标签: - 属性: 空 ID: 11864 英文名称: '' 中文名称: 文本检索发布方: - 微软（Microsoft）发布日期: 2023年4月1日发布地址: https://huggingface.co/datasets/lighteval/agi_eval_en 论文地址: https://arxiv.org/pdf/2304.06364.pdf --- # 数据集介绍 ## 简介 AGIEval是一款以人为中心的基准评测集，专为评估基础模型在人类认知与问题求解相关任务中的通用能力而设计。该评测集源自20项面向普通人类考生的官方、公开且高标准的入学与资格考试，例如普通高等学校招生全国统一考试（中国高考（Gaokao））、美国学术能力评估测试（SAT）、法学院入学考试、数学竞赛、律师资格考试以及国家公务员考试等。如需了解该评测集的完整说明 ## 引用格式 @misc{zhong2023agieval, title={AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models}, author={Wanjun Zhong and Ruixiang Cui and Yiduo Guo and Yaobo Liang and Shuai Lu and Yanlin Wang and Amin Saied and Weizhu Chen and Nan Duan}, year={2023}, eprint={2304.06364}, archivePrefix={arXiv}, primaryClass={cs.CL} } ## 下载数据集 :modelscope-code[]{type="git"}

提供机构：

maas

创建时间：

2024-06-29

搜集汇总

数据集介绍