HONEST (Hurtful Sentence Completion in English Language Models)

Name: HONEST (Hurtful Sentence Completion in English Language Models)
Creator: OpenDataLab
Published: 2026-05-24 09:30:28
License: 暂无描述

OpenDataLab2026-05-24 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/HONEST

下载链接

链接失效反馈

官方服务：

资源简介：

大型语言模型 (LLM) 彻底改变了 NLP 领域。然而，法学硕士捕捉并传播有害的刻板印象，尤其是在文本生成方面。我们提出了 HONEST，这是一个衡量语言模型中有害句子完成的分数。它使用系统的基于模板和词典的偏见评估方法，以六种语言（英语、意大利语、法语、葡萄牙语、罗马尼亚语和西班牙语）用于二元性别，并使用英语用于 LGBTQAI+ 个体。

Large Language Models (LLMs) have revolutionized the field of Natural Language Processing (NLP). However, large language models (LLMs) capture and perpetuate harmful stereotypes, particularly in text generation. We introduce HONEST, a metric for measuring harmful sentence completions in language models. It uses a systematic template- and dictionary-based bias evaluation approach, supporting binary gender assessment across six languages: English, Italian, French, Portuguese, Romanian, and Spanish, and employs English for the evaluation of LGBTQAI+ individuals.

提供机构：

OpenDataLab

创建时间：

2022-09-01

搜集汇总

数据集介绍