AWAREEVAL

Name: AWAREEVAL
Creator: Lehigh University
Published: 2024-02-16 17:47:38
License: 暂无描述

arXiv2024-02-16 更新2024-06-21 收录

下载链接：

https://github.com/HowieHwong/Awareness-in-LLM

下载链接

链接失效反馈

官方服务：

资源简介：

AWAREEVAL数据集由Lehigh University创建，旨在通过包含二元、多选和开放式问题来评估大型语言模型（LLMs）在五个意识维度上的表现：能力、使命、情感、文化和视角。该数据集通过多种问题类型全面了解LLMs的行为，特别关注LLMs在理解自身作为AI模型身份、识别其能力和使命以及展示社会智能方面的能力。AWAREEVAL的应用领域涉及AI对齐和安全性，强调了在可信和伦理发展中LLMs意识的重要性。

The AWAREEVAL dataset was created by Lehigh University. It is designed to evaluate the performance of Large Language Models (LLMs) across five awareness dimensions: competence, mission, emotion, culture, and perspective, using binary, multiple-choice, and open-ended questions. This dataset provides a comprehensive understanding of LLM behaviors through diverse question types, with a particular focus on LLMs' abilities to comprehend their own identity as AI models, recognize their inherent capabilities and missions, and demonstrate social intelligence. Applications of AWAREEVAL cover AI alignment and safety, emphasizing the significance of LLM awareness in the trustworthy and ethical development of AI.

提供机构：

Lehigh University

创建时间：

2024-01-31

搜集汇总

背景与挑战

背景概述

AWAREEVAL数据集由Lehigh University开发，旨在通过二元、多选和开放式问题评估大型语言模型在五个意识维度（能力、使命、情感、文化和视角）上的表现，以全面了解其行为。该数据集特别关注LLMs对自身身份、能力、使命的理解以及社会智能展示，应用领域涉及AI对齐和安全性，强调LLMs意识在可信和伦理发展中的重要性。

以上内容由遇见数据集搜集并总结生成

5,000+

优质数据集

54 个

任务类型

进入经典数据集