Bias Benchmark for QA (BBQ)

Name: Bias Benchmark for QA (BBQ)
Creator: 纽约大学
Published: 2022-03-16 09:35:45
License: 暂无描述

arXiv2022-03-16 更新2024-06-21 收录

下载链接：

https://github.com/nyu-mll/BBQ

下载链接

链接失效反馈

官方服务：

资源简介：

BBQ是由纽约大学创建的一个手工构建的数据集，旨在评估问答模型中的社会偏见。该数据集包含58,492个示例，覆盖了与美国英语使用环境相关的九个社会维度。每个示例都是由作者根据已验证的社会偏见构建的，旨在测试模型是否系统性地依赖于这些偏见。BBQ的应用领域包括识别模型在不同上下文中的行为可能导致伤害的情况，并探索需要进一步研究和缓解的偏见类型。

BBQ is a manually constructed dataset created by New York University, designed to evaluate social biases in question answering models. It contains 58,492 examples covering nine social dimensions relevant to the context of American English usage. Each example is developed by the dataset's authors based on validated social biases, aiming to test whether models systematically rely on these biases. The application scenarios of BBQ include identifying situations where model behaviors in different contexts may lead to harm, and exploring types of biases that require further research and mitigation.

提供机构：

纽约大学

创建时间：

2021-10-16

5,000+

优质数据集

54 个

任务类型

进入经典数据集