CUGE

Name: CUGE
Creator: 清华大学计算机科学与技术系
Published: 2022-06-14 15:19:35
License: 暂无描述

arXiv2022-06-14 更新2024-07-24 收录

下载链接：

https://cuge.baai.ac.cn/

下载链接

链接失效反馈

官方服务：

资源简介：

CUGE是一个全面系统的汉语理解和生成评估基准，由清华大学计算机科学与技术系等多个机构合作创建。该数据集包含21个代表性数据集，覆盖7种重要语言能力、18个主流NLP任务。数据集的创建遵循语言能力-任务-数据集的层次框架，旨在更系统地组织现有评估资源，全面反映通用语言评估需求。CUGE的应用领域广泛，旨在推动通用语言智能的研究与发展，解决现有评估基准的不足，如平面基准框架和过度简化的评分策略。

CUGE is a comprehensive and systematic Chinese understanding and generation evaluation benchmark co-created by multiple institutions including the Department of Computer Science and Technology of Tsinghua University. This dataset comprises 21 representative datasets, covering 7 critical language capabilities and 18 mainstream NLP tasks. Developed under a hierarchical framework of 'language capability - task - dataset', it aims to systematically organize existing evaluation resources and comprehensively reflect the requirements of general language evaluation. CUGE features broad application scenarios, and is intended to advance the research and development of general language intelligence, while addressing the shortcomings of current evaluation benchmarks such as flat benchmark frameworks and overly simplified scoring strategies.

提供机构：

清华大学计算机科学与技术系

创建时间：

2021-12-27

搜集汇总

数据集介绍