BiasBios
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/microsoft/biosbias
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是从Common Crawl数据集中提取的文本性传记,旨在研究职业分类中的公平性。此外,数据集中的性别信息是自动根据传记中的代词提取的。该任务的目标是深入探讨职业分类中的公平性问题。
This dataset consists of textual biographies extracted from the Common Crawl corpus, and is designed to study fairness in occupational classification. In addition, the gender information in the dataset is automatically extracted based on the pronouns within these biographies. The goal of this task is to conduct an in-depth exploration of fairness issues in occupational classification.



