BoLD

Name: BoLD
Creator: Authors of the paper
License: 暂无描述

arXiv2025-09-30 收录

下载链接：

https://cydar.ist.psu.edu/emotionchallenge/index.php

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是一个大规模的数据集，包含了23,679个英文文本生成提示，这些提示用于在五个领域：职业、性别、种族、宗教和政治意识形态中进行偏见基准测试。此外，该数据集还包括了能够触发文本生成模型在各种人口统计领域检查生成文本中偏见的提示。其规模之大，提示数量达到了23,679个。该数据集的任务是对开放式语言生成中的社会偏见进行基准测试。

This is a large-scale dataset containing 23,679 English text generation prompts, which are tailored for bias benchmarking across five core domains: occupation, gender, race, religion, and political ideology. Additionally, the dataset includes prompts that can activate text generation models to audit biases in their produced outputs across diverse demographic domains. The primary task of this dataset is to benchmark social biases in open-ended language generation.

提供机构：

Authors of the paper

5,000+

优质数据集

54 个

任务类型

进入经典数据集